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THE INDEX OF COINQDENCE AND ITS APPUCATIONS IN CRYPTANALYSIS > 



INTBODUCnON 

Frequency tables in tlio analysis and solution of dphers have commonly been employed 
to make ussiimptions of plain-text o<iuiviilcnta for tbo cipher letters constituting a message. 
The signiiictinco of the viiiioiis phases of the curves themselves, i.e., the crests and troughs 
iwid their relative positions in such frequency tables, has been recognized to some extent, but 
liitgi'iy only in connection witli the deteriniiiation of two more or leas preliminBjy points in 
their analysis: (1) whether the frequency distribution approxunates that of a sulMtitution 
ciphor involving only one alphabet or more than one alplnabet; (2) whether this approximation 
etirri''‘{X>nds to that of a standard alphabet, direct or reversed, or that of a mixed alphabet. 

1 1 will be show'n in this paper tiiat tlio frequency tables of certain types of dpheia have 
k'l-i'xiito i*h.iractcristics of a mathematical or ratlior statistical nature, approaching more or less 
>-i...'i'ly tlioso of ordinary statistical curves. ThoHa choructeristics may bo used in the solution 
itf aiich ciphers to the exclusion of any analysis of the frequencies of individual letters consti- 
..(I'iiig the tables or curves, and without any asaumptuma whaUoer qf 2 doi»-tetf valuator (ha 
Utten. 

U is true that cipher qrstoms admitting of such treatment are not very commonly encoun- 
tcrcvl. But inasmuch as such systems are always of a complex nature, which the ordinary 
mctlutds of cxyptanalysis would find rather baffling, a description of a purdy mathematical 
.iiuilysis that may be applied to other cases similar to the ones herein described may be con- 
•iidci-ed valuable. In fact, it is poesible that the principloe to be set forth may find con- 
^i(!c^!.uly wider application in other phases of exyptarndysis than is apparent at this time. 

Two examples of such a treatment will bo given in detail: One dealing with o-eubstitution 
cipli.-'r wherein a series of messoges employing as many os 125 random mixed secondary. alpho- 

<-an be solved without assuming a plain-text value for a single cipher letter; the otlier a 
:miliiplo alphabet, combined substitution-transposition oipher, solved from a single message 
of fair length. 

> llic present paper woe preporad In 1S28. It is a ravialon of an caiUor paper with the eaiae title, published 
in 1922 by the Rlvcrbank Laboratories, Geneva, 111. The author takce this opportunity to thank CbL George 
rahyati, of the lUverbank Laboratories, for his eourtesy in granting ponalsshm to publlih this revision. 
.Mihokigh ixiltcr methods have Im»u olaboratod since tbo revision was prepared, It hae been doomed of interest 
hiHliirically to publisii this paper in Ita 1023 form without change. 
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.THE VOGEL QUINTUPLE DISK CIPHEB > 

This (uplior systom involves tlio use of ilvo superimposed disks bearing dissimilar random 
mixed alphabets. These disks are mounted u]>on a cireiilur base plate, Uie periphery of wbieii 
is divided into 26 segments; one of those is marked “Plain”, indicating the segment in lino 
with which the succ.cssivo letters of tlie plain te.\t ns found on the five revolving disks are to 
bo brought for ciiciphcrraciit. The romnining 25 sogmcjits of lltc base plate bear the numbers 
from 1 to 25 in a mixed sequence, whicJli \ve have culled the “Numerical key.” This key may, 
however, consist of less than 25 numbers, in which case one or more of the segments of the base 
plate will remain blank. The numbera constituting tlie key are written on the'bose plate in a 
clockwise direction beginning immediately at tlie right of the plun segment (fig. 1). 

' Method of Ekcifhetcment 

In tlic accompanying example illustrating tlie detuls of encipherment it will be soon that tlie 
numerical key consists of 21 numbers, leaving blank, therefore, the four segments immediately 
preceding the plain s^ment.* ' Assuming a series of messages, let us suppose the first three 
.begin os follows: 

1. Prepare for bombardment at Harvey . , . 

2. Enemy attack on Huntorstown ... 

3. Second Field Artilleiy Brigade i . . 

Revolving the five cipher disks successively, and thus brining the first 5 letters of message 
1, PREPA, in line with the plain segment, reading from tlie outer disk inward in the order 1-2-3- 
t-5, the cipher letters for this first set of 5 plain-text letters ore then taken in the same order from 
the segments of tlie.disksdirqct]yin linowiththatscgmentoftliebaseplate that bears thenumberl. 
In this case it is the eighteenth segment after the plain, in a clockwise direction, and, os sliown 
in figure 1, tlie equivalent cipher letters for tliis group are IIEKJR. The second sot of five plain- 
text letters of message 1, REPOR, are then in a similar manner sot in line under the plain segment, 
and their equivalent cipher letters ore taken from the augment immediately following segment 
1 of the numerical key, in a clockwise direction, viz, segment 13. The cipher letters in this case 
are VZQWH. The third group of letters in message 1 finds its cipher equivalents at segment 

■ W'hi'iC oil duty in the Code and Cipher Section of the Inlcllisonco Division of the General Staff, G.U.Q., 
A.E.F., Lt. Col. F. Mooniiaii, Chief of Section, turned over to the writer for study a cipher system to;(ci1icr 
with a scrii's of 20 test moBsoges submitted by Mr. E. J. Vogel, Chief Clerk, who luid taken considerable iiUcrcst 
in ory]ilu}';.i|iliy and had, us a result of his studies, dovised tlie systom presented for oxaiiiination. The writer 
workci! U]'»iii tlio cipher during his Icisiiro moments, but the prohlom involved coiisidorablo lalior and solution 
was ncib cmuplcicd Imforo licing relieved from duty at Umt station. The main prlnoiplcs for solution, however, 
were ostabisNiiod and only tlio ilclailod work remiiliicd to bo coinjiloted. After an interval of more than a year, 
while Din-iiiir of the Cijiher Department of the lUverbauk Ijulxiraturies, Genova, 111., the writer tunusd his 
attcniiuii miCC more to this cipher and suoooodod bi eomplctcly solving the problem by carrying out those 
principles lo their logical conclusion. 

* It is rceommondod that the reader prepare a duplicate of the set of disks in order that he may moro readily 
follow the various steps in the analysis. 
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IS; Mio foiii'ili, lit spj'jiipiii S. Tlic fifth Fcroiip of plnin-toxt lotten, however, will take its 
ciplicr cquivnlcntiii ilu’ /Irsf p^^niont to the. right of the plain segment, inasmuch as the 
srgmciirs inimcdiflrol.v sogmont S are Iilank. Plain-text letters TATHA, therefore, will 

be enciphered on segniem 0, licconiiiig XONJE. This method is continued injike manner through- 
out message 1. If message 1 contains more tlinn 21 groups of S lottei8,.the twenty-second group 
will foke its cipher equivalents on sqrmcnt I again; tlie twenty-third on segment IS^.and so on. * 







































Ficuns 1 



Pi-oeeeding now to lucssago 2, tho limt 5 plnin-toxt letters, ENEMY, ore set up under the plain 
segment anci the. cijilier letters arc taken from segment 2, becoming LTVBH. The second group 
of a letle-m is cneipliorcd on segment 1, the third group on segment 13, and so on, throughout 
tlio message. Tlio first 5 lotfora of message 3, SECON, are encipliered on segment 3, becoming 
YAPAC. 'i'lie first group of cipher lottora in ii message is always to bo token from the seginont 
bearing the uiimhcr which coincides with the serial or accession number of the xhessage in tho 
day's activity. ........ 

It will bo soon, therefore, tliat no matter what repetitions occur in the plain-text beginnings 
of messages, the cipher letters will give no evidences of such repetitions, for each message has a 
different starting point, determined, as said before, by the serial number of the message in the 
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day’s activity. This automatic prevention of initial repotataons in the cipher text is true, how- 
ever, only of .a number of messages equal to the iungth of the numerical key; for, with a number of 
messages greater than the lengtli of the key, the initial s(^;ments from which the cipher letters 
arc to he taken begin to repeat. In tliis case, message 22 would necessarily have the same initial 
point ns message 1 ; messoge 23, the same as message 2, and so on. Messages 43, 04, 85, , 

would all begin with segment 1 ; messages 44, 65, 86, . . . with segment 2, and so on. 

The secrecy of the Bystom is dependent upon a frequent change of alphabets and a still more 
frequent clinngo of numerical key, since it must bo assumed, as with all ripher systems or devices, 
that sooner, or later the general method of encipherment will become known to the enemy. 
The only reliance, therefore, for the safety of the messages must bo placed in keying the specifio 
alphabets and tlie numerical key for a given series of messages from the enemy. 

Prikcxplks of Solution 

The following mossoges * ore assumed to have been intercepted within one day and therefore 
to bo in the some alphabets and key: 

Mebsaoe No. 1 



ULVXK 


QNXVD 


GIRIE 


lUNEE 


FEXVP 


HPVZR 


UKSEK 


UVQCI 


VXSFW 


GVART 


YBZKJ 


WVUPV 


XZCBD 


BDOLS 


GHINZ 


LJCIE 


KSLPY 


VPBYD 


VTRJK 


BDDFA 


ANJZE 


XGHED 


ERYVP 


YPWDJ 


DFTJV 


ZHTVB 


RXIMF 


OZDOJ 






Message No. 2 








UUCY 


GXAEU 


DTEIL 


UZBRV 


GJZSS 


QILUOZ 


PTFOO 


NWSHD 


BPTJO 


HQRYY 


YAXRZ 


KTEUP 


UAYUK 


ISRDZ 


VUVKW 


HXAYD 


YAGSU 


CURBZ 


UBXOV 


EBBPl 




UHAXF 


ZSLXV 


QFXUE 


UPZUK 


MQZZT 


XUURV 


EJVB 






Message No. 3 








YLUKW 


CBGSF 


VGABP 


HOZFV 


QQNSQ 


NQLQL 


DIGXU 


XCWAI 


QFJOQ 


TYDEL 


MBMJB 


SEPSO 


OHREM 


ELKIP 


KXNtfff 


QYBIH 


BHFDC 


GLYWC 


YGMUP 


EEXZH 


UBBSB 


SBONG 


URQXW 


YRAYU 


NYUCS 


LNEUV 


VNSXN 


WGVME 


UXPDF 


WGTZE 


KRLGU 


ZJFZJ 


W 














Message No. 4 








vmjj 


LTUKJ 


PONFG 


RIUGG 


OZGWS 


UBNUW 


WGILB 


JNXTD 


BPREX 


UKWHB 


OBFVO 


TFGSJ 


SUEH 


RTZUI 


LLUUX 


FIFffC 


PGSBA 


XRCAS 


XWXQV 


SLDKS 


NTESD 


QVQBN 


RZDMB 


JQLYH 


LTUXB 


BSV?VL 


ILKYU 


NMFSB 


HB 




Message No. 5 








VJr.5Q 


KZJJK 


DCVQD 


KSSXY 


TSUXE 


GRROH 


PXKZF 


ZXFHS 


VGDVU 


X1..TL3W 


EX.HEF 


AVVQWF 


ZESMX 


GWCEM 


JPNVB 


GRJGB 


IBffOH 


YMOAP 


IZY-NX 


GX14SB 


OZZGK 


FVURN 


NJFGQ 


DTPLV 


STOID 


DffVLR 


TXTBH 


iVXWIE 


YJXXW 


BKOJQ 


FOUHO 


T 











* Thcao an the actual msBaages nibndtted by Mr. Vogel. 
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Messaoe No. 6 



XGORF 


GCHAX 


DUEIQ 


XAOVK 


BP2UH 


Siam 


XNEYZ 


JAKUK 


BEQVG 


IWTVU 


NPCLU 


KQDIG 


YLUNC 


KYJJF 


SDSTU 


DCBUX 


OQUVA 


RHSXE 


VGQSV 


OGHPI 


XCYIO 


SUAEU 


BQAPY 


REDinr 


FVGSQ 


HYURQ 


LPUVN 


LNGQT 


IPJRI 


HUHAO 


DZUTW 


BFUO 
















Message No. 7 








UPUCL 


HJYDY 


ZTHKA 


NWJWJ 


IFUZI 


VZIJB 


QODUT 


HZPRX 


SWNFC 


DKXXR 


TOSVL 


UNHNO 


RZRTX 


QIPFN 


HOUNL 


FGUVO 


TPOWI 


VHNCQ 


RTDCO 


LNTTM 


IfXMXR 


TUIIG 


ZOJGU 


BTXJK 


KGUEJ 


USJBS 


ESVWJ 


IKGSL 


APA 




Message No. 8 








NWWVY 


UMHVB 


RPQHF 


XOHQN 


IPATI 


CFUZT 


DIUQI 


IRUJI 


NSBffR 


VTEGJ 


lAFEO 


BUUUT 


VPSKO 


HUYNA 


VPRXS 


SCUZB 


JHGDE 


•WOHJV 


6XHRP 


OIHWF 


LBTXF 


QESIO 


YXVNK 


AWDAA 


EQLVJ 


EYYTH 


GSEYA 


KLXHR 


NCVNY 


XSQRG 


XVZJC 


TJVSG 


UJEGZ 


TFONC 


HNPU 










Message No. 0 








NYMPE 


YZYRZ 


AVLPP 


lUPBB 


VffAXZ 


QSVFG 


ITZIIT 


KZNFC 


DHSUA 


MHCMP 


SOJKI 


GDPQZ 


IIPUV 


ZOGBO 


UCXXR 


SEPEU 


OTZMU 


AUHWX 


NDEUH 


YUEIP 


RFOHI 


QLQHG 


UFRT 


UTNQC 


JEGAF 


SQRWO 


DAJIIT 


YKXXQ 


VRQUW 




Message No. 10 








TEPDA 


XXHHC 


FYUFK 


QRBJV 


YVJID 


JBNXF 


JBLXU 


XSOXR 


XNHJK 


UFRXL 


WELQJ 


QJXFV 


RLSSF 


BQJWR 


BZXYN 


EAUWP 


AYSKO 


UWJDK 


GQRVff 


ZVXXH 


NZHGV 


SVEYH 


NEANW 


G 














Message No. 11 








SEYBZ 


MGSOZ 


dlPSQ 


BASFH 


VFSCG 


CHKSB 


ZOPRZ 


CZEVH 


OCADC 


LGRXO 


XXCKK 


UVQGJ 


XYUOG 


FFFVJ 


OZCJE 


AQSJY 


WZCIIO 


TGXVM 


XEXYA 


woia 


GTCWT 


TGLOI 


IWORT 


JJVQB 


HYMK 












Message No. 12 








OMIXX 


WUXZE 


YEHOJ 


ALJCO 


EPLPJ 


RBCVX 


WARW 


DYBDQ 


FTZDA 


XEUJG 


ROHUP 


OFSGQ 


PONLW 


RAIBP 


XACIB 


GIIYSB 


VXHEU 


XSPFG 


WKZTE 


KZYIZ 


ZXFJO 


HZIVG 


ESGOX 


YYGEO 


B 












Message No. 13 








QNKIT 


FZHNC 


GJBSA 


JIQBX 


PFTAO 


RJLUD 


IKPXI 


TNUVU 


EVGHU 


LJUFZ 


UBSUD 


VMMNZ 


UWVYQ 


DJPFY 


XETMN 


CLEQS 


DQXNA 


GEYHC 


JBCPG 


RNNNH 


BSYZR 


STGBV 


E 
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MES&AG19 No. 14 



ZJRI\K 


CLYZK 


RINOK 


NBTAK 


NOaBI 


RRZRX 


DQTAR 


AEKUY 


UOXLT 


yVWFL 


DXZSK 


ZPIVNI 


JUULI 


TPUJR 


TPQGH 


RJZCQ 


XCNHU 


NOKUI 


KKGiCF 


FJWWR 


ZXROX 


PZUGG 


HMYNB 


DUHQZ 


BJDFA 


FJAYU 


RKHKB 


ZMKMI 


D 










. . 












Message No. 15 








OSULC 


WZAFff 


SQAUC 


WPRBI 


WCFUO 


JXQHY 


OIWVX 


KSMSX 


MBXOZ 


QQDCX 


CBIUT 


DGIAS 


BAQRX 


RUKHF 


JYUJG 


DYNUA- 


CWULT 


.PXffES 


UUKTF 


FOQOK 


KNRPI 


PLLXQ 


DRDEH 


JHCFB 


WLHVX' 


FIFro 


STVJV 


XDCAS 


BEKQV 


OPWEY 


UTISE 


NZFCU 


BXI 














Message No. 


. 16 








VHXVD 


RSRYt 


PVNWQ 


QEDUJ 


LTKFN 


RHDGT 


DNUBU 


JDGVP 


■JCJUND 


RRQDM 


STTAL 


GKPPO 


PZTCU 


BNCEE 


HAWSIB 


KRGNU 


ZXCWQ 


VEZTC 


DIPSG 


ZUFVH 


OZIGB 


TQXND 


HUHEW 


VHTLT 




■ 








■ 


Messag]^ No. 17 








UMMOL 


HWEQ 


GffTJI 


PQTNS 


AEFZH 


TOFQH 


OOXNQ 


NPDDT 


QVSJR 


RAABX 


ZCVWR 


UUJEG 


NHJHQ 


CHZUld 


TTSIU 


GHELW 


HUUFH 


LZHNH 


CJEOW 


AKHZA 


SDZIE 


HIHlfG 


LBUFZ 


DVPUB 


DCUXO 


KAYEY 


GQHID 




• 




Message No. IS 


• 




• 


LALNH 


QUDUA 


ZBZUD 


VSJFE 


USHXff 


EUWZT 


OKVNO 


OOSIL 


TASEG 


OVQVX 


PUKKW 


BQRBI 


VGDGJ 


JDAH# 


RZDIA 


WQAXB 


FBRLA 


AHJEP 


PUUEU 


Hjqjp 


ZSPPQ 


yzipL 


HECDA 


LPJJS 


ZOJYB 


UO 










MKH».'^c;>; No. 10 








YTVUL 


TWEVD 


UBHKV 


IHTPI 


GNXBQ 


XAUAQ 


OUFVO 


GSMKB 


BAKIG 


YRNAF 


BKIJC 


ZJSRN 


WBQHM. 


UYJPT 


CHCCB 


RUIVH 


OLDQA 


ZZDCV 


UV7MNZ. 


OPRFC. 


RDONY 


RCZAM 


ZYNVQ 


flFONZ 


CTTES. 


IRWER 


GKETA 


YUSUK 


TFECO 


BMQVB 


VBWW 


VWCZP 


TffCTJ 


mFEH 


VNDCO 


UZVLK 


YUJPZ 


BHLDY 


WPMD 


KFWB 


VCYU 
















Message ^o. 20 








OBHOX 


UWRON 


AJDFH 


FRQUI 


ULOTG 


XXIEV 


HMAKV 


PVMAV 


OITKD 


LDQIW 


UYVWI 


JEJRQ 


MCUZP 


KGUDN 


QSPBF. 


TQVZP 


IZJTU 


RBUFZ 


JUSIZ 

LVD 


DCIGX 


QESJD 


LIZVU 


AOPUE 


Y^IEXI 


HOXJK 


KUYHK 


AORUY 




• 




Message No. 21 








Or.5XOV7 


LTV/EW 


KFJHN 


ZMSKK 


GXFDL 


YLCPT 


YYYNQ 


CYYMA 


ZRRAD 


HG7/SI 


HHCGU 


FDGMN 


XIQDN 


NPLYQ 


NPJNZ 


OWFW 


KHGVH 


KHCliC 


STNVZ 


CGXHV 


LZZXX 


SVTKG 


POEAC 


OJYQU 


UEULH 


KHYDD 


ETPDN 


QY2VU 


HIMGG 


RBHEV 


CUSO 
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Mebsack Xo. 32 



SFDF:.; 


AQMQW 


FFOMY 


GVFZH 


JTPSY 


lAONR 


TWSZJ 


OJUGA 


HOGXV 


YJOUN 


AEGQG 


HTZPB 


YRNEI 


B7/IVZ 


JNPXG 


SXXSZ 


HVRUC 


QEHQR 


eXSUF 


UKCTB 


BETEX 


ZSLRX 


QFXUV 


CLRIL 


LHIGG 


FVDSQ 


RJCffl 


JJUIV 


JFCEU 


QFPHO 


TFSOQ 


XKESC 


CJCFI 


BN 












Message No. 23 








LKKG? 


KUKNY 


WJROY 


XZHPK 


JJIHU 


LUVBU 


ZRUDA 


XXHRF 


UOAFT 


XUV;VE 


YZAJE 


NFDEX 


GDZGA 


JBZET 


XYHAL 


DGECA 


CDPYU 


BTMYK 


LCSbll 


YVSffW 


DQNAP 


KFPAO 


FIQTR 


XQQIE 


HHYCA 


GHEBF 


PFTYF 


DZAUO 


W 






















Message No. 24 








PUOOK 


OKffHR 


MPXQI 


PIQMN 


ALffNX 


HZIKQ 


XQNUY. 


GQZNG 


■*dftfo 


YJODO 


XIRIX 


KDCBX 


UEQTU 


VPIWA 


NFVOH 


GWXXY 


EKBUG 


HGETD 


VFKKZ 


PXXZL 


XFRWL 


FHTEG 


INNJI 


ETVUP 


QlfTjy' 


OP 










Message No. 25 








FPETJ 


CDVNY 


LUKQU 


CDALX 


FIYGR 


HQMIP 


FAONR 


QCAVJ 


UFIAC 


YXDKR 


GWUQL 


FQMEJ 


KOBliS 


ZURAN 


ulzfe 


YDLOT 


UZMJM 


SETNP 


GWILM 


FGCVS 


NCZGB 


HUIRT 


XUWAI 


DGAVL 


QBTFK 


VYIGT 


FLUVK 


FJYAZ 


YQNVO 


WSMSS 


CClIFY 


KVXRA 


y 














Message No. 26 




' 




XFYKU 


NDBFZ 


KNDMF 


GBVIJ 


TKNDA 


LGPKS 


CWPSK 


BSNffH 


LTTXN 


VDCYO 


GYZLI 


TRURC 


GBIWL 


VEKYG 


YOOHV 


lAUNO 


LPSJJ 


UTNIE 


ZAECE 


XQOYB 


FDBPB 


SEBMA 


SRUEU 


RUHKE 


RIYIT 


BDWVG 


UZNGB 


TRJTG 


PZJCH 


LSFXW 


ANHJO 


UZHRQ 


ULXFU 


XSINC 


UyKNT 


ZWXXQ 


V'lVRIH 


MPMHG 


DURHG 


IJKIR 


UFRNA 


APKQP 


KAXUF 


CACVC 


LNICD 


ESOWP 


HJQEC 


GGJBF 


SCaiFC 


TQRGT 


tJODEE 


XHRXE 


KIOZG 


DLRZV 


DLZeS 


EHVZL 


GAMFQ 


ULGXC 


??IZWK 


IIZYY 


HTAW 


OTTTO 


RFTHL 


HQWVZ 


XZUAB 


GLHMH 


ZFTIQ 


OTJEP 


VZXCL 


YFYOE 


LSIJU 


SYBHC 


YQXGD 


SECCE 


CIJTI 


BUILZ 


ZUAPV 


QRYXB 


VHUWL 


LPXGC 


RLETI 


DJBCV 


IHSAH 


RHCUJ 


RAQBJ 


IWORY 


OISIE 


RESXE 


PAUJD 


SHQVB 


VEASF 


TZeWL 


AUHKA 


MSTZZ 


DOQYB 


RCUUS 


UXWQX 


WDPDB 


BYGMQ 


XRLCH 


lOKZO 


EPLQU 


XJZKI 


JNSSI 


KCHXX 


ZMZKV 


PVUGD 


QCVQ 



It may bo of advantogo to begin the elucidation of tho piindpleB of solution by translating 
this cipher into tenna of the sliding of primoiy alphabets against one another with the con- 
sequent pj-nduction of a multiplicity of secondary idphabets. For example, by using ordinary 
sliding alpiinbots such os arc commonly used in cryptanalysis, we may produce the same 
results Hs are given by the set of concentric disks. Let us use the alphabets of the illus- 
trative disks, mounted upon sliding strips in pairs, and let us slide each pair of alphabets 8 
letters apart. Thus,4f wo consider the upper one of each pair of alphabets in figure 2 as tho 
plain-text alphabet and begin each alphabet arbitrarily with the letter A, we have the following: 
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rPlain text AUFQZERHYGVJOIDMNCTZSBLPKVAUFQZERHYGWJOID 

^(Cipher AUFQZERHYGWJOIDMNCTXSBLPKV 

pnaia text AOXGFZMYLPUKETDJVSWIRBNHCQAOZGFZMYLPUKETD 

^[Cipher AOXGFZMYLPUKETDJVSWIRBNHCQ 

jFloin text AWJBQIHVKPUFOGTNEDSZXCHLRYAWJBQIHVKPUFOGT 

^|ciphor AWJBQIHVKPUFOGTNEDSZXCMLRY 

rFlain text ACVXZWUTFNUIOSEHJRDLGKQBPYACVXZWUTFNUIOSE 

^(Ciplier ACVXZWHTFNUIOSEH^RDLGKQBPY 

jFlun text AETJUDUVZNWHXOGYKFRLQBPCISAETJUDliVZNWHXOG 

^Cipher AETJUDUVZNWHXOGYXFBLQBPCIS 
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Xofp now- lluit tlip fiiwt set of it |pt.tci«, PREPA, yields the twine act of 5 cipher 

letters, MEKJR, thiit we found on psii^ 2 hy usiiij; the diskn. llie <nily tiling; wliieli thefic five 
pniiw of Independont slidinp; nlphnhrls have in exminion in fi|;iirc 2 is the fact iJiat each pair init; 
been slid apart the same number of letiivs, vix^, 8; if we ronsider the upper alphabet in each pair 
ns the slalionaiy alphabet, then (ho lower one h.-r-t been shifted 8 intervals to the rif'ht, or 18 
intervals to the left, of the upper aljihabec. Tltis eoirespnnds to the position of iiimibnr 1 in 
figure 1, for the latter number occupies the- eightccutli segment to tiio right of the plain segment, 
or tlie eighth to tho loft. The relative positions of the numbers in the numerical key, therefore, 
correspond to tlio numbers of intervals the priinar>' alphabets in tlie form of sliding strips would 
have to be. displaced in order to produco the same results os the disks. 

Now tho sliding against itself of a primaiy sequence containing 26 iotters will give rise to 
a scries of 25 secondary cipher alphabets; ' likewise, each primiuy concentric sequence will 
give rise to a series of 25 secondary alpbabets. If the numerical key consists of 25 numbers, 
all tlicse secondary alphabots will be employed; if it consists of less than 25 ziumbcrs, then a 
correspondingly decreased number of secondaries wrill be employed. 

Since cocli primary sequence can give rise to a set of 25 secondaries, the total number of 
possible secondniy alphabets in the whole system is 125; but if the numerical key consists of 
less than 25 numbers, then tho total mmiher of secondaries wrill be less than 125 by exact multi- 
ples of 5, since the absence of ono or nioro numbers from the key affects all live primary concen- 
tric sequences. For example, if tlie k(^ consists of 21 numbers, then there will be involved 
21X5, or 105 secondary alphabets. In a message of exactly 105 letters, then, each letter will 
be enciphered by a different secondniy’ alphabet. If die message contains more than 105 letters, 
then all tho letters after tho 105th wrill bu oneip)ierc«l hy tlie same secondary alphabets as at 
tlie beriming of ihc message and in the same sequence. 

In the explanation of the motliod of encipherment it was made dear that the substitution 
proceeds in a regular manner, taking successive groups of 5 letters; tho dpher equivalents are 
taken from tlie successive segments, proceeding in a dodewrise direction from any given initial 
segment. It follows, thoroforo, tliat in a single long message wherein the complete endpheiment 
requires tlic passing tlirough of this sequence of segments more thon one time, there eodet periodic 
or cyclic phenomena of a type found in various dpbers, due to the presence of a definite or regu- 
lar et/cle. In this cose, the Icngtli of tliia cydo in terms of groups of 5 letters eorresponds exactly 
wdth the length of tiio numerical key; its length in terms of individual letters is five times the 
length of the key. For tho sake of dority, wo shall refer to this cyde when stated in terms of 
lottors 08 tlie period. Thus, witli a key of 21 numbers, tho length of the cyde is 21 groups, and 
the Icngtli of the period is 105 letters. If a message consists of 316 letters, for example, tlie 
letters would pass through three complete cydos; tho 1st, 106th, and 21Uh letters would be 
enciphered in exactly similar positions, and therefore by exactly the some secondary alphabet. 
The 2d, 107th, and 212th letters wrould likewise be ondphered by the same secondary alphabet, 
but of couisc not tho same as the preceding secondary alphabet. With a of 23 numbers, 
the length of tlie cyde is 23 groups, the length of the period, 115 letters; the 1st,' 116th, and 
231st letters would bo endphered by the same secondary alphabet; the 2d, 117th and 232d 
lottors by a difFerent secondary, and so on. If we represent the length of the period by n, 
tlicn the 1st, (n+l)th, (2n+l)&, (3a+l)th, . . . lottors fall in the same aeoondazy alphabet; 
tlie 2d, (r»+2)th, (2n+2)tli, (3n+2)th, . . . letters fall in another aecondaiy alphabet; and 
so on. If a message be longer than the period, therefore, it will follow that tho 1st, 2d, 
3d, . . . nth secondary alphabets must contain ropetitiona of dpher lettora, representing 

* The twenty-sixth secondaiy slplisbet eofaicides with the nonul alphabet, siiiee essb plaiihtext letter 
would bo represented by itadf in that secondary alphabet. 
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r(‘p:-liiioiis nf ])I,uii-Ic'NIi IoIUts, for IIk'si*. s-.-i-oixtiir.’ iim uTUt :ill iiiily sin"lo niixccl 

I’ililiiT :iii(l Mu- ri'|<i>rilifin tiniijjli-fii'ijiioiify |.■•li'•ls in uniiinuy plniii. .'.RxL is a iifRCSSAry 

oi' aIi :il|i!).‘il:i>:ir;i] I'l'pi .i'.iuns xill bn itviilr-nmi b.v icpclitions 

in :ii<' i-ipiii'r aI n, '2n, Hii iiilnrvA.'s, nixl iiiny iii:iy bn ii»k 1 1« dnlnniiinn tlio ]c‘ii<;lli of tho 
Mxsn-ily Imw Ibis is »ioiin will i>ri’snn|ly lx* ilnmunsfrntind. 

Mill. Ilin ilnln'iiiiiniiioii nf fix* lnn,^lli of llm iinriixi is only o slight si-op forwarcl in tiin miiil^'sis. 
It- is t-rxn I-IihI ii- will pvn ns f.lio Inii^lli of llm tiiiiiinrin.:ii koy, luif. tlint is till. Wliiit wo must know 
lU'Xl is llin sncpx'TX'-o nf miinbnrs, or rntlinr, (bn rnlnlivc ]>osi(-inna of Uio miinbcni in 1-iiis koy, 

W\' may asnnrtnin this by further soniliny of the tlicorcticol one! actiml results of the method 
of nnniphnrinnnt. It is often the c»se with various ciphers that the method of onciphennent is 
nxri'llnnt in prinni])le, and will yield prnrticaU.v iiidociphnniblo inessii-j;cs when tlie mesMigcs arc 
very few in niiinbnr, but tho weaknesses in the method are quickly disclo^ when it is used for 
regular trninc such os that nocossoiy in military cryptography, wliero many messages are to be 
sent cacli day in tho same key. In tho cipher undew elimination, the weakness is introduced by 
the f.int that the initial segment for each message of tlie day's activity is dotetmined by tlie serial 
number of the message.* Now tliero are as many imtiol segments for each numerical key as .there 
are numbers in that koy. Once tlie starling point is determined, all the messages pass througli the 
same cydo; different messages merely br^n at different points in the cycle. Now, since tho 
numbers applying to these starting points constitute tlio sequence of numbers in the key, the 
successive initial segments constitute a series or sequimce which, when properly rooonstructed, 
will give us the sequence of numbers in the koy. 

After tho numerical key has been reconstructed, we are yet a long way firom solution, for we 
arc still confronted by the more complex problem of reconstructing, or solving, tho cipher 
alphabets. 

We have so far analysed tho solution of tho problem into the following three stops or jdiasK:. . 

1 . The determination of tho length of the period. 

2. Tho Toconstruotion of tho numeriral key. 

3. Tho reconstruction of the cipher alphabets. 

Lot us proceed, dieroforo, to perform oacli stop. 

1. The determination of the lei^^th of the period. — was explained above how the cipher 
system will result in tlio production of repetitions in the cipher text at definite intervals dependent 
upon tho length of the period. The first, (n+l)tli, (2n+l)tih, . . . letters fall in the some second- 
aiy alphabet; the second, (n+2)th, (2n+2)tli, . . . letters fall in another secondoxy alphabet; 
and so on. If tliere are repetitions in tlic plain text at n intervals apart, there will be correspond- 
ing repetitions in tho cipher text. Tliero would bo involved hero only a sUf^tly modified case 
of the ordinaiy process of factoring the intervals between repetitions in tho cipher text, as applied 
in the solution of typical periodio multiple-alphabet dphera. Thus, in .this case, if it happens that 
the first, second, and third letters of a message, and also the (n-f l)th, (n-|-2)th, and (n-|-3)th, 
ore the letters THE, then there must be a repetition of the initial trigraph .of the cipher tmet, 
representing THE, at a distance of n letters. But in a cipher involving so many alphabets as this 
one, the repetition of trigraphs and polygraphs would natiunUy.be rather infrequent, except in a 
very long message. 

However, the paucity of trigraphs and polygr,aphs, and even of digraphs, need not prove 
to be a great obstndo, for the repetitions of individual letters may bo used with great accimacy 
for the same purpose, viz, Uio determination of the length of tho period. The method is based 

‘ Howover, were the initisl oegmon ta detennined ix sduifl other manner, the filial resuite would be the eame, and 
the cipher could be solved by a alight modification of method. Even If the initial segmeota were subject to no 
law, &0 cipher eould still be aedved by the method hordnoftor set forth, with eame modifieationB. 
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iilioii llic I'onstriii'iion of what wo have I'aHoil a “Tiiblo of coincidence*’, wiiicli will show us 
mnthi'maiii'ally the iin>s( prohnble Icnplli of the ]M‘riiKi. Wo may ns well iiho the text of our 
series of n-.e.-sa^es to ilhistralo the ]iroorsK. 

If we assiinie the niiinerirnl key to consist of 20 numbers, then the lengtli of the period would 
he 1 00 lf( ters. I^et iis write tho lon^st message of our series — vix, mossogo 2G — in exactly siipor- 
iiiiposi'd lines eoiitaining 100 letters cneJi, and then make n count of the rccuironcos, of more 
neeiirately, (he coineidcnces,' of lettors within the individual columns tlius formed. 

Note tho ropetition of tJio letter F in tlie second column. This fact is indicated by placing 
a eh(-<-k mark in tho tabulation of coincidonocti. Wliero a letter appears three times within tlus 
sniiio eohiniu (B, in column S), throo clicck marks ore recorded, for there we have a coincidence 
between tho first and second, second and third, and first and tliird occurrences. Where a letter 
appeal's four times in tlio some column, six diock marks are recorded. Tho number of coinci- 
dojie.es fur rjudi caso corresponds to tlie number of comlnnntions of two things that can be made 
from a total of n tilings.* 

We note that on tlie assumption of a period of 100 letters there is a total of &0 coinddences. 
Now, if tlie period is really 100 letters in lengtli, then the repetitions of letters within columns 
are not mere coincidences brought about by ^ance supciimposition of idcnticol letters but are 
actual recurrences in the restricted sense of being tho resultants of the endpheiment of similnr 
plain-text letters by the same secondary alphabet. But there is no way of determining from 
this single tabulation wbetlier the assumption of a period of 100 letters is correct or not, and 
tliorefore we do not know whether the repetitions in this cose are recurrences or coincddences. 
This we con determine, however, by a compumoa of tabultOiona of coincidences made upon 
'various assumptions of lengtli of period. Theoretically, tlie correct assumption should yidd a 
liiglicr total of coinddences than tho incorrect ossumptionB, because the recurrence of high- 
frequency plain-text lettors (in English, E, T, 0, A, N, I, R, S, H, D) is to be expected, and the 
number of such causally produced repetitions should cortsinly be greater than the number 
of repetitions produced by mere dinnee in tlie superimposition.* 

Lot us proceed, tlicroforc, to make a table of coincidence for the various probable lengths of 
period, lu'st transcribing mossago2G into lines uori'csponding to hypotheses of 105, 110, 115, 120, 
and 125 loucts.* Before doing so, however, wo find it necessary to introduce a few remarks 
upon the desirability of uang a slight correction factor for this table. 

> Yic ilrtixf llic dutiuctiun betwoon rfcvn-fium ami cviMciVracM on tho grounds that tlie fonaer tens should, 
and will huiv lie UM'd to indicate repotilione of letters ia Uio ciiilior text oausally rdatod to eooli other by being 
encipiiiTiiiciits of identical plain-text lotten by identical nliiliabcte; whereas tho latter term Indioatee repetitions 
not iMOwiiilly relutod to each oilier in this manner bnt simply the result of ohnnoe. A repetition may tiioroforo bo 

is n!ei:-.-ri'iicc or si raincidnico. The proecHS of factoring in ordinary multiple-alphabet eiphen of the periodie 
tv|ic lii.rti'.'.r iis purpose i lie Hcparaticni and eliiMificatiim of rciietitlons into tlie two kinds. Until proved otherwise, 
nil n-jii lit inns must be c(mHidcri!iI coincidences. 

-Till- fiiriuiila is: C— n(m— l)/2. Thus, wlicii mbS, the number of eoinoldonoos Is 10; when nag, tho 
liuiiilii'i' is 1.1. 

• Si nee this p:i]s-.r was written, a further study of the concept of ooineidenoes has nuulo it posciblo to prediet, 
with r.-iir •Icgrce of iieciinicy, just how many coineldcnous should bo expected for eorroot and Incorroet ossump- 
1 ioiiii. Th- ' iii:il hcnintical and statistical anolysca arc given in detail in W. F. Friedman, Analyns tf a Meehanieo- 
Elrelr''eal C.'ujtlonraph, Section VI; 8. Kiillback, Slatittieal Uethedt wi Cryptmialyna, Section VII (Technical 
Pub'i.Miii.:.:, 3. 1. S., 1034). It results from these mathcniaucal studieethat the ratio of the number of actual 
coinui.1 ine.'s to the total number of possible coincidences is .038 for on incorrect case and .066 for a correct one. 
This 'ki. iwiedce climiuates tho necessity for tabulations emresponding to every possible cose and gives a rdiable 
means of determining the conect assumption os soon os it is mode. (See Notes 1 and 8 on pages 18 and 14 
respectively.) 

* The .licssage need not be written out more tlion once if long strips of crose-ieetton' piper ore used, writing 
a Unu Oil Ci.cli strip. Each line should oontain 136 letters, and the vaiions strips earn tiiaa be aiiaaged to bring 
the pFopce letters into superinoMrition aoeording to each hypothesis In tom. 
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For n'lilly nrr.iimlfl coinpiuiiion, tho tnUils of eoineidenoo obtained for tho various liypothosns 
fihoiild hfl I'orrocti'd in order to make proper iiltowaneo for tho dilfemncra in totals duo eolcly 
to tli.> variation in llio mimber of Icitera in each column wlion troiiacribod acconling to cudi 
hypo^lvr^i I'rom a cij'ptogrnpbie point of vienr, a total of 100 coincidences in an arransemnnt 
where there arc 6 letters in each column represente a slightly groater degree of coincidence tlion 
in An arrangement of the same massage also yielding 100 coiniddenceB, where there are 7 letters 
in most of the cclnichs; there ia leas opportunity for c o inddenoea to be produced in the former 
case. We should, therefore, reduce all the totals of coinddenca to some common basis. The 
reasocing v.-p l^va followed in the establishment of a correction factor to be applied is as follows: 
\lossas<^ 28 contains exactly 539 letters. When transcribed into lines of 100, 105, . . . , 125 
letters, the columns in each of these fire ret*ups have the following number of lettan: 

I 



Tabu I PerSiyJ 

Otticn) 

80 et'lusnns of G Ictten and 61 eolnmiia of C letters 100 

14 oc»luii.uE it -2 Isttcn and 01 colunma of 8 loiters ... ... ................ 1C8 

09 colai!'.ns of 5 letlcm and 11 columns of 4 loiters 110 

TO eulvni'iii. '.<r IcKiira and 36 ooluiniui of 4 letters ....... IIS 

80 oi 5 letters and 61 edumns of 4 lettvni ........ ^SO 

89 ccluir.ua cf -3 letters nnd M edamns of 4 loiters ....... .... 125 



Azs'j:: '.r,:; -jiiut perfect eoinddeuco con occur in each column (all letters identical), tiien iu 
a eel*, /m uf 8 letters we con have 6 X 5 / 2^15 coinddenees; in ^ column of 5 letters, 5X4/2 hb 10 
eoinciisrccs; Hud in a column of 4 letters, 4X3/2«6 coinoidenceB. 

If nciv v.'c find the total number of chauccs for emncidepces for each of tho arrangements 
given in table I, we have the foUowing: 

Tablb II 



Ttrioil 

(Micro 


Condttlon) 


Total 

ctaansH 


100 


89 epliimne of 15 dianees eadi, fil ooluinns of 10 ehaneee Midi ... 


1. 105 


105 


14 eoUinnu of IS diaueu uauh, fll uulumna of 10 eliAneM 


i, 120 


110 


99 cdumns of 10 dinncee eadi, 11 columns of 6 chanees eadk....... - - - 


l)056 

1,006 

086 


115 


70 eolumns of 10 vliuives eaeh, 86 cidumiis of 6 ebnnoes muCi . . .. . 


i 130 


60 rolumiis of 10 dinnee* eaeh, 61 oolunms *>f 6 rhAnn^ maIi 


! 125 

1 


30 ooIiiniuA nf 10 idiancM Mch, 80 auiumiik uf 5 duuiCM Mah . . 


006 





Qioosing for our boaie of eompaiison tiie hypotheeie of a period of 100 lotters, the various 
proportions of chances for coincideiioes for each of the mmiuning- f'ypirthinfe wfll oonetitute 
correction factors to be applied in eeoh case. Thqf are as fcdlowa: 



Tabus III 



Mud 

(MlarS) 


ChaaNsior 

colaililgatc 


OomoUoB 

Mar 


100 


1,106 


1.00 


105 


1,120 


. LOT 


. 110 


1,056 


LIS 


115 


1,006 


LIO 


120 


066 


1.25 


185 


006 


L82 



* See footnote 8 <u the praeeding paie. Thb eorrsetloa faetor ie wnnewssry if the number of aetual eoia- 
ddeuees ie reduced to a pereonteBi ba^ la tanas of tlw total pornttslo munbor of edneldaneas. 
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Wo arc now runtiy to establish tlio tables of coincidence for the various hypotheses. Space 
forbids the actual demonstration of the several arrongomonts of message 26 to correspond 
to the various hypothetical ki^' lengtlis — tliat shown in figure 3 is typiool of them all. We 
shall give only the final result in table IV. 

’ Tabu! IV« 



Prrl-u 

(letlrn> 


Co Induction on urh hypolhcals 


Total 


(kmetloD 

hetor 


CotTeeUd 

total 


i 100 


na m iHim ifu tfti tHum 


33 




SSlO 


105 


m mi iH! ifuiHimi iHi ttiHfu 


45 


■Ih 


4&2 


110 


ifu mi mi mi mi mi mi mi mill 


47 




58. 1 


115 


nu mi mi mt mi mi mi mi mi mi mi mi 


00 




71.4 


120 


mi mi mi ml mi mi mi 1 


SO 


1.25 


45.0 


i 125 
! 


mi mi mi mi mi mi III 


33 


1.33 


.43.6 

e * 



Thi'ro seems to be no doubt but tluit the perioil of 115 letters is correct. The cycle, there* 
fore, consists of 115-:-5=23 groups, and the niinieric-.al key contains 23 numbers. This means 
that the two final segments bear no numbers, and arc tltcrcforo blank segments. 

2. The reconstruction of the numerical key. — Having ascertained the length of the period, 
and Lhiia the length of the numerical kev, tiio next step is to reconstruct the sequence of 
numbers consiiiuting the key. As stated before, this process is made possible in this case 
bv the method of encipherment which is such that all the messages of the day’s activity go 
tnntugh exactly the sumo cycle, but the successivo messages begin at diiTerant initial pointa 
in this cycle, and lliesc points coincide with the relative poaitiona of the mimbora making up 
the sequence of numbera in tlie key. 

Wo do not know the alwoluto position of any numbers in the numerical key, bdt we may 
proceed first to find tlieir relative positions, regarding the key in the nature of a continuous 
cycle or chain. Later wo nuy find the absolute pasitious of the numbers in tliis cycle, i.e., we 
sliall liavo reconstructed the numerical key itself. 

Now, us stated before, all mesaages proceed tlirough the some cycle; it u only the initial 
puiiiis for the messages which are difforont. Hence, if wo can detemiino the relative positions 
iu wluc-ii messages 1, 2, and 3 sliould bo supciimposod in order to make oil Hiroe messages 
coinclvio us ivgords the portion of the cycle through wliich tliey pass simultaneously, we shall 
tlius have detenuined the relative positions of tlio numbers 1, 2, and 3 in ihe cycle. For 
example, if we should find that the first group of message 2 belongs under the twelfth group 
of message 1, and tlie fint group of message 3 under the sixth group of message 2, we would 
conelude that tlie relative positions of those numbers in the qyde ore those: 



1 23456789 10 11 12345 



1 



2 • • • • 3 



* As a verification of Mote 3, page 11 above, the iJoroomagoB of tho actual number of ooinoidcacos to tho 
total possiblo uumbor bos boon caJoulatod: 



Poriod Pscombiiii 

100 a 033 

103 040 

110 014 

115 000 

120 033 

125 — 036 
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How aro tlioso relative positions for 'messages 1, 2, and 3 to bo dotormined? Clearly, wo 
may use the same basis for this dcturmiiintion ns for that concotnod with tlio length of tho 
]ieriod- -viz, a table of eoincidences. 7\ir, when messages 1 and 2 aro coiroctly superiniposod 
wc should get si liigher degree of eoineiiieiire I>elw'ee.ii the lottora of tlifl siipcrimiios^ cohunns 
than W'hcn they are not con'cctly superimposed. Tlio reasons are tlio some os in tlie preceding 
case: The successive groups of cipher letters in one message represent endphorments by Uie 
same secondary alphabets as apply in tlie other message; lienee, nqietitions of plain-text letters 
within columns will result in repotitioiis of dpher letters within those columns. We can there- 
fore detonuine tlie correct suporimposition by cxpciimcnt and the recording of coincidences. 

Having found tliat the key contains 23 niunbers, it is obvious that message 24 has the 
same scartiug pdnt in the cycle as message 1; we may proceed at once to combine them by 
direct superimpositioii. Wo do likewise witli messages 2 and 25, and 3 and 26. The purpose 
of this stop is merely to afford greater accuracy through the increased number of letters with 
wliich we shall have to deal in finding tlio corract relative superimposition of these three sots 
of messages. 

Since message 26 contains the greatest amount of text, we may regard it*, as our base and 
tiy to find the relative position of messages 1 and 24 with respect to it. We now place messages 
1 and 24 beneath message 26, beginning the first group of t^ former bdieath the second group 
of tho latter. A tabulation of tlie number' of coinddenees in each column is then mode. Mes- 
sages 1 and 24 are again placed bcneatli message 26, beginning the first group of tho former 
beneath tlie third group of the latter and again the total number of coinddenees is ascertained. 
In other words, messages 1 and 24 arc moved succesdvely 1,- 2, 3 ... 22 intervals' to the right 
of mossago 20, and a table of coinddenees is constructed. Ihe greatnt total number of coin- 
cidences, as shown in table V, is given when mossogog 1 and 24 are plooid three intervals to the 
right of message 26. This means, then, tliat tho numbers 1 and 3 oocuiy theae relative pooitions: 

12 3 

3 • • 1 . 

Tabl* V* 



liitcrrula 1 2 » 4 6 G 7 8 9 10 11 12 13 14 18 16 17 IS 10 20 21 22 

Coincidences 60 40 SA 40 63 SO 55 60 69 47 66 46 61 64 68 64 40 68 60 48 62 40 



Showing vorioua totsla of coincidniico when mcncngee 26 sod 1 sad 24 On saperlmpooed st different 
intervale correeponding to the euccecaive. bypotheeec of rsbttlve position. 

Messages 2 and 25 are next taken for experiment and a siinilar table of coincidences is made 
for tho various superimpodtioiis with inosBoge 26, omitting, of course, the three-interval trial 
since tlie position corresponding to tluit test we found to be occupied by the number 1. As soon 
:is tho relative position of eodi number is found, tho subsequent trials corresponding to those 
nuinbers may bo omitted. Tlio labor is, of course, somowhat tedious but may be done by 

> 1 1 WM t Iiiiijglii II iiiicmiiNiiry to use botli incwagcs 3 and 2G os a base, since tlii; latter alone iiucijned siiSlcion tly 
I'll.,-.', !>i r.iv: rcliubli: iiiforinniiijii. 

^ All iiiiiTviil ill UiiH i;iii:n l;i f:i|iin! tn n. i'ri<ii|> i>f C InUnr.i, Iircimiims Uhi ciiiniiihiirniont |inicourls In HRiOueiils of 
.'i Iri.i iTii f.-u-.Ii. Wo iiocil try only tbe first 22 iiiicrvalK miicb the eyolc coiwiKta of only 23 pwitioiis, and incwuiges 1 
anil 'J-l uaiitiiit have the same beginning point as nicssagCH 3 and 26. 

' Yv ill) regard to tbe inatlieiuatical notinii diaeiissod in Note 8 on p. 11, tbe following remark is ixirtiaont: 
Tlie till 111 iiiiiiilicr of raiiiicldcncos ixwNililo wlicn iiiOKMgi*s 1 and 24 are placed tiirco intorvuls to tho right of 
iiiosKiiKn 2G is 12.^3. Tho iiiiiiibor of aeliiid ciiinciclcncca, SO, when divided by 1253 gives .OGS. Since tlio osiiceted 
leMi'.t for a curreot assumption is .OGG, it is at iiiieo evident that this assumption Is comet and it is consequently 
uniicci.-siiury to oonsidcr any furtlior oases. 
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dorks. Tills procos.s is ooiitin not) in siiiiilnr innimi'r for cl! fhe remaining messages. The data for 
inossagos 2 nnil 20 s.liow ilia t Ihoy belong fire intervals to tlie right of messages 1 and 24, and the 
relative positions of the numbers 1, 2, and 3 in the cycle arc therefore these: 

1231234C 

3*‘1***"2 

The data for thia determination of position for all messages ore given in table VL 

Tabu VI.— Data /or cb(«riHtiiaXM« of Ike potilieH the nvembert in qieb 

Mbskaob 3S Umb As a Babb 

JPoBiUon a 8 4 8 8 7 8 0 10 11 18 18 14 16 16 17 18 10 80 21 38 38 . * 

^IKumbor of colnoldonces 80 40 88 40 68 80 66 60 80 47 65 46 61 84 68 84 46 68 60 48 88 40 

JPooition a 8 6 6 7 8 0 10 11 18 18 14 16 16 17 18 10 20 JU 33 M 

'l21iimbor of coincidoooea 46 40 68 57 08 Cl 00 



^Position... 
*lKumbcr of 

.JPosition 

umber of 

.rPoaitlon... 
'*l2$umbcr of 



Position a 8 5 6 7 8 10 11 12 18 14 16 16 17 18 10 20 21 22 28 

Number of coinoideneee.. 84 85 88 36 16 33 27 23 10 24 38 24 24 81 10 36 20 42 86 82 

Position 2 8 6 6 7 8 10 11 18 18 14 16 16 17 18 10 30 23 88 

Number of ooinoldenoes 44 81 88 81 83 30 26 23 16 38 40 



JPoaitlon 3 3 6 6 7 8 10 11 13 18 16 16 17 18 10 80 22 88 

iNumbcr of coincidences..... 86 88 87 86 40 81 20 11 66 

JPosition 2 8 5 6 ■ 7 8 10 11 18 15 16 17 IS 10 20 32 38 

HNumbor of coinoideneeB 38 88 26 27 85 30 24 81 81 16 10 38 38 38 M 

JPosition 2 8 5 6 7 8 10 11 18 18 16 17 18 10 88 28 

iKumber of coinoidenim.....' 86 27 W ^ 

JPosition a 8 6 7 8 10 11 18 16 16 17 18 10 83 38 

"iNumbor of ooincidenocs 37 82 26 17 21 21 SO 17 13 21 27 68 

.JPosition 2 8 6 7 8 10.11 18 16 16 17 10 22 28 , 

^°|Number of coincidencoe 18 18 16 18 28 10 14 18 13 20 10 16 IS 63 



..fPOsition 

^ iNumbor of cot 



. , ij Position — 

iNuiuber of cbineidenccs.... 



3 3 6 7 8 10 11 18 16 16 17 10 33' 

16 10 15 14 16 16 M 17 M 20 » 10 

3 IS 16 10 

21 80 82 16 



-JPosition a 8 6 7 ' 8 10 11 16 10 17 10 23 

'iNuinbcr of eoinoidcuecs.. 18 27 66 

-JPosition 2 3 7 8 10 11 15 16 17 10 M 

INuinber of eoincideneos..... 20 10 Uf 18 18 10 16 38 

-JPosiUoii 1 a 8 7 8 10 11 15 17 10 28 

^ INumbcr of coincidences 20 17 32 80 18 20 83 38 24 

, JPosition 2 3 7 a 10 11 16 17 Iji . 

"iNumber of coinaidenees_..j. 38 82 87 37 24 60 



^Sceondaiy test, using 



1 and 84 idue 2 and 36 as the basa 



eouo-ss — 1 



I 



REF ID:A64722 



Tablis VI . — Data for determinalioH of the ptuiUon of the nutabere in Contlnuad 

MiasAaE 28 Used A* a Bau ' ; 

f Position 2 3 7 8 10 16 17 10 - ■ " : ■ ; 

^''[Nunilicr of coincidences..... 84 28 47 , : 



jyf Position 


..28 S'lO 16 17 10 


■■ 




^^INumlxr of coineidcqcM... 


.. 16 M 30 






I Position 


.. 2 8 10 16 17 10 




*1 ' 


^‘’iNunibcr of eoineidoneos... 


.. 48 81 26 ; 






j^fPosition ... 


.. 8 10 16 17 10 


* N 




**’lNumbor of coinoidonoos... 


.. 28 87 06 






^QfPosition 

\Number of colneidencM..- 


... 8 10 17 10 
.. 66 14 




■ 


fPoslilon 


.. 10 17 10 


. 




^Number of eoineidenees 


.. 26 83 ^ 






../Positioii 

^Number of eoineidenoes... 


.. 10 17 
.. 22 






_ - 1 Position 


.. 17 






\Numbcr of ooinoldeneai 


.. m • 







Wlion in any trial tho total of coinridenees for a certain position atanda out pitominently 
from tho preceding ones, aubaoquont trials for tho message coneemed are omitted. 

The final result of carrying out this work for all the messages tried against message 26 is 
that tho following cycle is established: 

1 2 8 4 5 6 7 8 0 10 11 12 18 14 16 16 17 18 18 80 21 28 28 
8 18 17 1 8 12 16 20 2 22 16 6 11 6 10 IS 88 0 21 7 4 14 10 

Tliis reconstructed oyde represents, as stated before,, the idatiye, not the absolute, posi- 
tions of the numbers in the key, because there is os yet no indication os to what number occupies 
any given segment on the base disk. Furthermore, we mut remember that there is a break 
of two intervals somewhere within this cyde, representing die two blank segments on the base 
disk. This foot may cause some difiSculty later on but we diall find a way of overcoming it. 
In the meantime, we may content oursdves with the cyde as estoblidied and proceed to an onaly^ 
sis of tho results of its reconstruction. 

Tlie immediate result is to enable us to superimpose all the messages of the day’s activity, 
09 shown in figure 5. We may begin with message 1, or with any other message, but for the 
sake of convenience in anolysu, we may os wdl transcribe them in regular order. 

The letters of each of these 116 columns bdong to a conreaponding number of secondary 
alphabets, all different, but all single, mixed, subsritution alphabets. Individual froqueney tables 
are mode, therefore, oi^ are shown in table Vll. The tables are given in groups of five, labded 
A, B, C, D, and £, corresponding to tho five primary alphabets of the system. The groups of 
alphabets are given in their proper qrdic sequence, so that each set of five alphabets is accom- 
panied by a number whidi identifies its position in the sequence of segments. Thus, we may refer 
to any of these secondary alphabets by number and letter, os for example, SB, meaning tho second 
single alpliabot under segment 6. The A alphabets all apply to the outermost primary alphabet, 
or alphabet 1 ; the B alphabets to pzirruuy alphabet 2, and so on. We are now ready to attempt an 
analysis of the dpher text with the object of solving these secondary alphabets and reconstructing 
the primaries. 
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“ I 

The first thought that comes to'ono is that theso indmdual mixed alphabets may be solved i 

upon the b:isis of frequency alone, os is commonly done with such froquoncy distributions. For | 

example, wo might assume the most frequontly occurring letter in each alphabet to be tiie equiva- ! 

lent of plain-text letter E, the next, of T, and so on; tlien substitute tlie assumed values in the text 
and.tiy to build up words. But each of these olpliabets contains only an average of 36 lettoia, so ! 

tliat hardly any assumption would cony a considerable degree of certainty. This is especially I 

the case in Englisli text whero the letter E does not always stand out prominently as the most ; 

frequently used letter in small amounts of text. Were an analysis of this kind absolutriynocossaiy i 

to solution, it is doubtful whether this particular set of messages could be solved except after a 
long period of patient labor. But it will bo sliown now that such on analysis is in fact not esson- | 

tial, because we may be able to effect a direct reconstruction of the five primary alphabets, which 
will'not only lead to the solution of sll these mesBages, hut will also give us every one of the posidUe \ 

125 secondary alphabets of the entire qystem. , 

Tabus YU [ 
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Tabu VII — Continued 
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Tho method which we am about to demoniiirato ia based upon tho fact that the segments 
from w]ii<-h the ei]iher gruujis are taken fallow one another from a given initial point in a 
regular siiceession, uninterrupted in tliia ease extu^pt for a break of three segments representing 
the two liiiiTik segments of the key plus one blank wliicli is always present representing tho 
plain segment. To explain tlio principle of tliis method in detail, attention is directed to the 
fact that, as a result of tho system of ondphennont, tho sories of successive dpher equivalents 
for any given plain-text letter in any one of tho five primary alphabets coinddes with the 
sequence of letters in that alphabet. The scries will emndde with the complete alphabet except 
for the omission of 1, 2, 3, or more letters depending upon the number of bhmk segments. 
For example, tmm to figure 1 and note that, in alphabet 1, the sequence of letters b anning 
with A is os follows: AUFQZERHYGWJ.... 

Now if is patent that ij we place later A qf the fret primary alphabet in the plain segment, 
its series orf siueevnoe cipher equivalents coincides with the sequence qf letters succeeding' A in the 
same alphabet, ia2,UFQZERHyGWJ0IDlINC. . . . 

If we place another letter of the some primary alphabet — for example. Z — in the plain 
segment, its series of successive dpher equivalents constitutes exactly the same sequence, 
except with a difiierent initial point, viz, ERHYGWJOIDllMC.... Di other 
words tho succesdve dpher equivalents for those 2 pldn-tcxt letters come from one and the same 
cj'de or sequence. Now, tho same is true with respect to every o&or letter- of alphabet 1, and 
also of the other primary alphabets. Of course, the sequence is different for each primary 
alphabet. 

Since this cyde or sequence of letters is tlie same for all the letters of each primary alphabet, 
only tho scries of succesdve dpher equivolouts for one letter of each primary alphabet is necessary 
in order to effect a complete reconstruction of that alphabet. In other words, if we can sdeot 
with accuracy tho dpher equivalent for one and only one j^oin-toxt letter in eadx of the succesdve 
115 secondary alphabets, we con then arrange tlioso equivalents into five sequences of letters 
wliich will coinddo with tho five primary olpliabots, thus resulting in thdr reconstruction, 
lire reconstructed sequaroos will be complete except for tho omission of one or more letters 
representing the blank segments. If tho numerical key consists of 28 numbers, three letters 
will be wiindng from eodi sequence. Those letters will be known, of course, but their rdative 
podtions in the omitted section will have to be formd lattw. 

Obviously, the letter which will lend itsdf best to such a procedure is E, for it is the most 
frequently occurring letter in English text. If, therefore, by a careful study of the individual 
frequency tables applying to the columns of tho superimposed messages, we can select the dpher 
equivalent of only the letter E with certainty in the succesdve secondary alphabets, we diall 
at once have the sequences of letters in the five primary alphabets and the solution of the 
problem will be at hand. For example, if in a hypothetical sequence of these alphabets we 
sdcct tlie letters K, N, Q, and V, i^peotively, os the four succesdve dpher equivalents of E, 
tlicri this will mean that in priiruuy alphabet 1 there is a sequence . . . K N Q V . . ., 
providing a break in the numerical key docs not exist between tho members of the sequence 
of key iiumbotB api)lying to tho segments concomod. Continuing this process, ultinuitely tlie 
five piimaiy alphabets can be completely reconstructed. But we must remember always that 
this process is dependent upon tho correct ossumptioxu for the dpher equivalent of E in each 
of the 115 secondary alphabets, or columns of dpher text. 

Let us attempt such a reconstruction. Turning to the series of secondary alphabets given 
in t.tblc VII, wo try to find in each alphabet tho letter which undoubtedly represents plain-text 
letter E. At the very start wo encounter difilcultios. In alphabet lA, the letters H and Y are 
of equal froquenqy. There is no way of telling which letter represente E, so that we dioU have 
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to consiilor Imfli M and Y as possibilities. In alphabet again wo have difGcuItios, for both 
J and Q ha\'o ihc saino frequency. It bqginii to look liko a very doubtful pracodura. As wo 
go further dong, tho diOucuIties in selecting the represontativo of E incroaso rather than diMsceasc 
and the cryptanalyst becomes lost in a miiUitilicity of possibilities. Evidently litis method, 
as tlio preceding one, while theoretically con'uct, is practically out of the quostlcm because 
of tho limited size of each frequency table. In fact, it is doubtful whether we can select the 
roprosentativo of E with certainty in any one of the A alphabets,' and certainly, if we cannot 
do this with tlio letter tliat theoretically occurs the most frequently, we cannot do it with 
any other letter. 

It W'as at this point, when apparently a blank wall confronted tho writer, and there seemed 
little hope of solution, tliat lie evolved tho mctliod which finally resulted in solution, and which 
embodied such new principles that ho was led to describe tiieni in this paper. This method 
hod recourse to some simple mathematics, easy of comprehension and application when the 
underlying principles have been grasped. 

First,- let us make what wa have termed a “coDBoiidated frequency" table for all of the 
secondary alphabets applying to tho first, or A, primary alphabet. This is done by colleeting 
the data contained in the individual frequency tables ^own in table VII into one largo table, 
taking only the data applying to tlie lottors of primary alphabet 1. This larger table is shown 
below (tabl e 

* It vaa fonnd lator that tiie eiphor equivalent ot E has tiio greatest frequency in only 8 out of the 28 
alphabeta. In one alphabet E did not occur at all, and in dz cases it occurred only two times. It will bo of 
interest to the reader to study these tables for the Infotination they contain with regard to tho eoctrama riegrefo 
of variation from the nomal that small frequency tables oan exhibit. 
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Tliis coiisii]i(lfltP(l rmpipiicy tnblo is of :i mtlipr {tcciiliar nature. ’Each cohium tlio 
frpi[iu':u’v of (in' ciplior It'Ctcrtt in n porficulnr iiiid tlioro aro 23 hucIi coluinus, corre- 

Kpumiiii'r to tiio 23 si'i^monls of ilic miirK'Wciil kny. Tlio iiuiiiborR of tlio columns are tlclonjimpd 
by, aiul ooinciclc with, tlio sequence of nuiiihiM's in tlic c^clo ns g^ven on page IG, viz, 1, 8, 12, IG, 
etc. , Each row gives the frequency of n particular cipher letter in the successive segments and 
since the columns succeed one anotlicr in tlic cyclic sequence, it follows that the frequendcs 
in tbe successive segments on a lino with any given cipher letter form a definite segvenee oj 
Jnquencici. Tlicre being 26 dpher letters, there are 26 such rows or sequences of frequencies. 
The toml frequency for encli dpher letter is given in the column labeled os such and the 
average frequency for all cipher letters is then found to be 841-»-26=a32.4 occurrences. The 
numboi* of diTFercut st^ments in which tlic cipher letter applying to any given line occurs is 
indicated in the next column; and tlio average frequent per segment for each dpher letter 
is given in the last cohmm. 

Before w'O can proceed it will be od\'isable to estabUsh certain prindjilcs which will enable 
us to follow die subsequent reasoning more easily. We shall make use of olpriabet 1 shown 
in tablo VllI, calling attention to die fact th.'it the same prindples apply to the other four 
primoiy alphabets. In order to moke die iliiistmtion comparable in oil its details with the 
real situation in the test problem, lot us make the numerical k^ 23 numbers in length by 
adding numbers 22 and 23 at the end of the key shown in figure 1. 

Lot us see what suecoadva phun-text letters die dpher letters A, B, and C represent in the 
sequence of segments. . ‘ 

FtacRE 0. — Ptai-A text 

A— V XPLBSXTCNUDIOJWGYHREZQ 
B-S XTCNMDIOJWGYHREZQPUAVK 
O—H MOIOJWGYHREZQFUAVKPLBS 

It will bo noted that die successive pliiui-Uvct letters which dpher letton A, B, and C repre- 
sent conslituto almost exaedy the same sequence in die tlizeo lines. This follows fi-om the 
nature of the dpher system itsdf, and the cause of it has already boon pointed put. In the B 
lino dicre is a section not present in the A lino, consisting of tho letters FUA; in the A line, the 
section not present in tho C line consists of the lottcm XTC; and in die C lino, tho section not 
present in tho B line consists of tho letters PLE. This is due to tho intorrupdon in the numerical 
key; tiro secdon omitted will consist of 3 sequent letters in each cose, but those letters will be 
d ifferent for every dpher letter. 

Let us now accompany the sequence of the plcin-text letters oppodte each of the letters 
A t B, aud C, with a sequence of frequencies corresponding to their nornud theoretical ftoquen- 
oies ^ for English text. 

^ These tlicorctieol frequenoies are given by liitt on the boiie of 200 letten of plain tart. Bee Hitt, Parker. 
ilfaRual for tho Solution of Military dphero, 1018, p. G. 
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I7«uio7siiiu«aoi9tt3 tii 3 lUtttaa 

KPLB8Z70HaCSXO7WaTEBB2Q 



ZTOVKDZOJWGTKBBSarVAT 



^ g g g g g g 
g " g ^ g g 



5 § g g g 
g g g 
5 g 



S g g *a =s 



NMD Z O TWO -T KB 



g 5 5 



i i 



g g g 
g g g 



ZaxrVAVEPDBB 



s g 5 g 



Now, nnoe the eequenoee of plain-toxt letten repieaeated by these sequenoee of frequeodes 
die the some, it follows that we eon so arrange the latter as to make the sucoeasive individual 
frequencies ooinoide; and if we xnake due allowanoe for the break in the sequences caused by 
the omitted sections of 8 letten, the three sequences should ooinoide eaaotly. Thus: 
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!VB:PLBSZTCKM3>I0J'W0Y]£&&ZQ[ M* 1VSPLB8ZT0B’ 

p=5:§j$| jl|i| j| I ji| 
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■g g^g*^5gg ggg g g g^g 

1$ g g ^^g g 5 g § 



*K.VB X O 7 W a 7 a Xt X Z d 7 xr A V K 7 L B 8 [ M IB 

ggggg-^^^ggg igg^5=^g5g I I 
g-^gg ''ggg -'-g ill 



In order to make tlie Bequencea ooiiicido, we displaced ihe B sequonoe five intervala to 
the light of the A sequence, and the C sequence four intervals to the right of the B sequetusc. 
Let us reverse the order of those letters, A, B, and C, and space them in accordance with the 
number of intervals which each sequence of frcquoncieB bos been diifted rdative to the others. 
Thus: 

4iai«s4sai 

C...B....A 

BofOTnow to the illustrative cipher alphabet in figure 1 and note that this corresponds to the 
order of these letters A, B, and G in thu piiniaty alphabet. We have determined the order of 
these letters in our alphabet merely by coiTOctly supciimposing or shifting the three sequences of 
frequencies relative to one another so as to make tiio individual frequencies coindde. 

Now had wo not known what letters tlioso individiud frequencies in each sequence of he- 
quoncics repi'csontcd but had merely boon given the sequence of frequencies tliemsdves, it would 
still Imvo been just os cusy to find tlie correct relative positions of the three sequences from a 
comparison of Uio positions of high and low points in each ssguenee iff fretpundea. Xu other 
words, noe do vai need to know vAat letters the iiidividtialfrequcneies in eatih sequence ef frequencies 
represent ) it is still possible to dotonuino tlxo I'clativo positions (in the primary alidiabot) of the 
lotlprs :ip])Iying to each sequence in tiio dplicr dpluibot Iry a study of the -positions of the high and 
low points -In eacJi sequence of frequencies. No analysis wliutevor of tlie individual froquondos 
is ucccssoi-y, the entire frequency table bdng treated os an ordinary statistical curve. This, in 
its final analysis, is the moaning of the propodtion stated in the opening paiagraidi of this 
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paper.’ It tlius follo^vs tliat tlio five alpliabcta of ourproUem may bo reconstructed, wilihout 
a knowledge of whnt letter any individual frequency in tlxo sequences of frequencies (os ^wn in 
table Vlll) represents, by an analysis of tlicao frequency tables considor^ as true statistical 
curves. 

Let us return now to the tost messages. Table Vm represents a set of 26 sequences of fre- 
qiicncies similar in oiigiu to those for A, B, and C in the fllustrative alphabet above. Wo could 
superim])osc these sequences in tlie same niiumor and as easily os we did in the messages them- 
selves wore it not for two drcumstancos: l^lrst, wo know that there is an interruption of three 
blanks in tlu) cycle wliich we liavo reconstructed but do not know whore those blanks must be 
insort od. Consequently, some allowance must be mode for the blank segments in each sequence 
of fivtiuoiicies. Secondly, tlie individual frequoiicios in each sequence of froquenciee in our prob- 
lem do not exactly correspond to tlie theoretical frequencies of the plain-text letters to whidi 
they apply but only correspond approximately to the theoretical. In some cases this approxima- 
tion is far from close because of the paucity of text, and this will make the determination of 
tlie correct relative positions of two sequences a much more di£Beult process than was the 
case witli tlie illustrative sequences above. 

We ore, therefore, confronted with tlic problem of superimposing the sequences of firequendes 
correctly witliout a knowledge of these two factors, and tliis we shall accomplish by a dight modi- 
fication of method and a recoiuse to some simple mathematics. 

'First, as to the modification of metliod duo to our ignorance of the exact location of the 
break of tlirce intervals in the numerical key: this consists in superimposing sequences, not to 
find tlie relative positions of any pair q/' aegueneea but to find such sequences as are on« and only 
one intereal apart; i.e., sequences which represent a relative displacement of only one interval. 
The reason for this step is now to be explained. 

Lot us consider the sequence of theoretical frequencieB cotrespondmg to the cipher letter A 
and the letter which immediatdy follows it in the illustrative alphabet, vis, 17, arranging the two 
sequences as though we had only reconstructed the cycle and hod not as yet determined the 
numerical kqr« Let us begin both sequences with segment 0, the first segment in the key. 

‘ The ordinary frequency table applying to a plain text or a eipber alphabet does not eoriecpond to the 
ordinary frequeney distribution of statistical work. In tlie latter, the position of the points along one of the axes 
of tho graph and their extension along tho other axis are either oausally related, or the curve treats of data whirii, 
• being subject to the operation of the laws of probability, form the normal, or Quotelet, curve of error. In the 
former, the positions and extensions of the coordinates are not related In any way unleu one considers the arbi- 
trary order of the letters of the alphabet as constituting a cauu. The positions of the ooozdinatu in a crypto- 
graphic curve u'oro determined many oeuturics ago when the English language was lint evolved. 

But tho sequences of frequencies in table VIII are not similar In origin to the ordinary plain-text or cipher 
alphabet frequency tablu of cryptogropliic work. Tliey are. In fact, cloBely related to certain frequency distribu- 
tions of slatiiitical data because ^ positiou and cxtcrisiuiis of tlie cooedinates ore absolutely doterminod by a cause 
other than the arbitrary order of tho lotton of the English aljihabot. Then two ehamcteristlcs of the curves of 
a series of secondary alphabets may bo varied at will by clionging the sequence of lotten in the primary alphabet. 
Any act of frequency dlstributioiia applying to a scries of secondary alphabets derived from a varied primarj* 
alphabet may be treated in tho same mathematical maimer os then will bo treated in the subeequent pagu. 
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It is evident that we mi^ superimpose tliose sequences correctly by shifting the A sequence 
one space to the right of the U sequence or the U sequence one space to the left of the A sequence. 
Thus: 

FioTmx 10 

A 

9 17 i 14 10 7 31 II 10 0 30 U 11 3 3 11 3 1 13 IS 3 31 a ' 0 17 4 14 U 7 tt 11 U 

VXPX<B8XTO]rXDZOJWaYXBBZO[ M IVXPLBB. , .■ 

= j j j I '=**'§1^1 



'0 17 4 14U7ttllU 
]V X P I. B B . , .. 



4 14 10 
X P I. 

=5 ^ =5 

s: 



11 10 0 a 19 U 3 5 10 3 1 13 U 8 a a 0 17 4 14 M 7 31 11 a 

XTCXXDX07WOYBBBZ[ M lAVXPBB. B.. 



ggg 



ggg 

ggg 



H g 5a 






Noll' now Mini. coiii|)lol.rt niid pcrfoct coini'idonco between sticccsrivo pairs of supcrimixisod 
sogiuenU is oblninoil Axcojit in two casca, vi/., Ilioso uivolving the letters A and Q in tlio U and A 
lines, n'speeliwly. This is due to tlio fact that tlio brook in the sequence comes between sej^ 
ments 23 and 0. 'flio frequency of letter A of tlio U se<]ucncc siiould bo matched witli the frn< 
qncncy of A in tlio A sequence, but tlie latter docs not occur booauso of the break in the numerical 
key. The same is the case with the letter Q of the A sequence, 
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Now suppose lihat wo did not know whore tlio break in the numetieal foils, and lot us 
suprrimposo the sequences again. Thus: 



Kiocbs 11 



B 17 4 14 10 7 a II u a » u u I • 
VKPI.BBZTOKMDX O JT 




Ul lUUaBX0 17 4 14 10T»llM 
WOTKBBSOVZPXiBS. . . 







■ff 



0 17 4 U 10 7 U 

A V K P X. B 8 




iiMsaiBua sus i »isa»XBi7 4 uio7aiiuo 
ZTaxrXDXOXWOYKBBBAVKB-X.BB . . . 












It is seen that perfect coincidence is Still maintained throughout except in the case of the one 
pair of segments containing the letters A and Q of the U and A lines, respectivdy. By omitting 
the three blank segments representing the place where the break occurs, we have brouf^t the 
letters A and Q into an inconect superimpoaition. But the amount of error due to the super- 
imposition of one pair of incorzeot segments os ogoinst the coireot superimpoaition of 22 other 
pairs is of so Uttlo consequence that it may be ni^i^ted altogether. In other words, when we 
superimpose sequences iMeh are only one inierval apart, relative to each other, we nuiy neglect 
the discrepancy that would be due to our ignoriuico of where the break in the numerical key 
comes. 

Now suppose that wo did not know thsit the letter U immediately follows A in this illustra- 
tive primary alphabet, and had only a table oootaining the bequenoios applying to the cipher 
lettors (similar to those shown in table Vm). It is evident that by placing the A sequence one 
interval to the right of all other sequences succossivoly, and ohoosing that sequence which most 
closely coincides with the A sequence in the positions ^ the high and low points, we ihall thus 
have determined what letter immediately follows A in the primary alphabet; this foIlowB because 
the letter applying to that sequence of frequencies occupies a definite position in the cipher 
alphabet, viz, it follows A. Li this case, the U sequence would bo chosen and we would conclude 
that tlio sequence in the primary alphabot is ... A U ... . Taking tlie U sequence, tire same 
operation is perfeomed with the otlior soquoncos ns before, and wo thus find the letter tlurt 
follows U in tlio primary alphabet. Thoorerically, therefore, wo should bo able to reconstruct 
the complete primary dpliabet by this method, and tlius overcome the difficulty duo to om: 
failure to know exactly where tlio interruption in tlio numerical key falls. 

In this process of matching sequences of frequencies to decide wliidi one most closely 
coiiiciiles vdtli a given sequence, we oomiot depend upon a mere ocular examination and com- 
parison. Wo must reduce the operation to a matlrematical method. This, wo shall proceed 
to consider. 

Let us return to table VUI and sdeot that lino for experiment which gives the best indica- 
tions of representing the closest approximation to a theoretical fipeqaency table containing as 
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tow elements ns nro ciiiilnincfl in nverngo lino in Uio tiiblo. In a thoorotioal frequency table 
of sranll sfin) such as the one shown in figure 12 only the liigh-iroqueney letters ere represented; 
the low-frequcncy letters are absent. The average frequency per letter that does occur is 
evidently the total frequency, via, 84, divided by the number of different letters that go to 
make up this total, via, 12. This quotient is 34-i-12as2.83. 

t 

FiauRS IS 

AB0SX7OKX 7SIiSIirOPaBIT1TVWXTS 

i '"‘I = § " 



Total froqiicnc!3rBS4 
Numlior of different kito»<»lS 
34 

Avenge fraqueney*"— ■=•2.83 

Note now that the Y lino of frequencies in table Vllt averages 3.00 oceurrenees per eegment; 
that is, the average froquonqy per plain-text letter whidi Y represents is 3.00. This is even a 
little bettor than the average theoretical frequency per letter as determined above. Let us eonp 
sider the Y sequence of froquendos, therefore, os ropresenting the closest approximation to a 
theoretical fiequenoy table of a similar total of oceuironces. 

Following the method discussed above, let us see if we can find the letter in alphabet 1 
which follows Y. 

Taking our Y sequence of frequencies, lot us apply it to all the other sequences, placing the Y 
sequence one interval to the right of the other sequences. Thus, with A the Y sequences are 
placed in these rdative positions: 

Rgubb 13 
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B 
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B 
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We slxall now proceed to find an abstract number such as will indicate the degree to which 
those two sequences of frequencies agree, or fit, when placed with reference to each other as in 
%ure 13. It is evident that when we strike the letter which really follows Y in alphabet 1, Ihe 
corresponding sequence of frequences concerned diould give the best fit with the Y sequence 
and thus produce the greatest degree of coinddence. 

Let us now compare the two superimposed sequences above, segment by segment. In the 
upper one of the first pairof superimposed segments there are 2 occurrences of A; in the lower one, 
5 occurrences of Y. The first pair of segments agree, therefore, in 2 occurrences; Le., there are 2 
ooinddcncos. In the next pair of segments, an occurrence of 1 in the Y sequence is matehed by 
an occurrence of 1 in the A sequence; i.e., there is 1 eoinddenoe. Let us go through the rest of 
the segments in the same manner, ^nie results are given in figure 14. 
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Wo liavc II tciliil of 12 coinoidciicoK. lint wo imint also toko into consideration tlio number 
of iioncoiiioidcnoofl boLwoon tlio two soquonoi^; for tbeso, ns can easily bo domonstrntrd, arc of 
equal importnnro with tlio coinddcncos. In Uio two liypoUioticnl sequences given below, both 
with the samo total frequency, tlio nunibcr of coincidences is very liigli, viz, 28, yot tlio two 
sequences do not agree dosoly at all, for tliero are 45 noncoincidencos. 
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Let us find the total of noncoinoidouccs, therefore, between the Y and the A sequences. 
In the first pair of segments there ore 3 noncoincidenoes; in the second pair, none; in the third 
pair 1, etc. Lot us now add these to our table, and indude also thb niimbw of occurrences in 
the segments, for wo sliidl have need of this information very soon. 
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We find that the total of coinddonccs is 12, that of noncoinddences, 34. The difference 
is 12—34 =—22. Were the sequences in closer agreement, this difference would bo a 
positive quantity; but as a rule, we sholl find it to be a negative quantity in our work because 
of the fact that the froquondes are rolativdy low throughout. In tliis case, then, the number 
of noncoinddences is 22 greater than the munbor of coinoidenoeB. This difference between 
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tlio tolak of coincidences and noncoineidoners will bo usod ns the bans for the detonnination 
of the tn which two scqiic-ncps coincido, nnd inasmuch as wo d)all have a ureut many 

s(ic}i ill compute, a short f.iit to their clctciiiiinntion will bo of uso. If wo subtract 

the to^-il of oi-cinrcnccs from three times tlio total of coincidonccs, wo can find this diffenmee 
(lireclly wilhnut Imvin" to count up tlio nniiilier of noncoineideneos. Thus, in tliis oaso, 
(3X 12)— uS -- —22. In all subsequent detcrmiiiiilions wo shall uso this motbod. 

Now it is obvious that tlio iiiimhcr of coincidences ns well os tho number of nonooineidonoos 
is not only u function of tlio distribution of tlio occurrences in each sequence of froquoneiea but 
also of the total nnmbor of ocourrcnccs. It is patent tlmt in ono pair of sequences with a 
grvntrr total number of occuiTonces than in .‘inother pair, the totals of coinddances and non- 
ooir.eidonces might be greater in tho former than in tlie latter from tlie mere fact that there 
arc more opportimitics for coineidonce and noncoinddonce in the former case. Wo should 
therernre take into consideration the tottol number of occurrences in the two superimposed 
sequences, and tlie most logpcnl correction would be to divide tho diileronce between tho totals 
of coincidences and noncoinddencos by tho total nnmbor of oeeurroncos of all the si^gnents. 
For example, it is only reasonable to phico more rdianco upon a cose in which out* of 30 occur- 
rences tho difference between tho totals of coincidences and nonednddenees is -t-10, than 
upon a COSO in which out of 60 oocurronoes the difference between these same totals is also +10. 
In tho fonner case, the quotient obtained by dividing tlie difference, +10, by the total ocaar- 
renres, 30, is +.33; in the latter case, tho quotient obtained by dividing +10 by 00 is only 
+ .17, only half ns much. 

To, tins quotient, whidi indicates in a gciiei-nl way the "goodness of fit” of the two super- 
imposed scqurnccs, and which is obtained by dmding tho diiToraneo between the totals of 
coincidoners and noncoinddencos by the total occuironccs, we have applied tho name "Index 
of coincidence”.* It is evident that the grcalnr the index of coinddcnce, tho bettor is the 
flgn'ciiietil hetwoen tho Hiipcriinposed secpieiirr-H, nnd thus, tho closer is tho fit. Where tlio 
two .se({iiciiccs are relatively low in fm|iiciicy, tlin total of noncoinddencos mil, os a rule, bo 
greater than the total of coincidences, so that the difference will usually bo a negative quantity 
and the index will also bo negative. As tlieso negative indices approach 0, they become dosor 
to positive indices, so that when we ore dealing with negative indices, the lowest absolute index 
will indicate tlio greatest coinddoneo. Thus, on index of —.03 will indicate a much better 
fit than an iudox of —.35. 

Ketumiiig now to tho case in hand, wo found tlio difference between the totals of ooinddences 
nnd noncoinddencos to bo —22. Since a total of 58 occurrences enters into the formation of 
thoso two. tables, tlien tlie index of coinddence for tho assumption that A follows Y in alidiabet 1 
is —22 -i- 5S = —.38. 

Let us perform the same celeulations for tlio rest of the letters in table VIII Qp. 30), omitting, 
of course, Y. Tlie data are given in table IX. 



I Sitv & Kiillboek, loo. oit.,.Sootions VI and VII for other and more rellablB teifa for matohlng alphabet!. 
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Tjibls IX 
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As stated above, the best fit is obtained when the index of coincidence is greatest positive 

quantity. In none of the cases above is the indcoc of coineidenea poaitiva,‘but the value for 
P, vis, —.12, apinooehoB the nearest to a potitive quantity, and therefore represents the greatest 
dogroo of ooincidenee. The next greatest index is given by the letter Q; but inasmueh as the 
index for P is olmost tvice as groat as that for Q, we maj conclude that it is the letter P, and 
not the letter Q, whidi inunediately aueceeda T in the alphabet 1. 

We may now proceed to find the letter that follows P. The same operation are performed 
with the letter P as with the letter Y‘, tins time using the frequency of P as the base and 
trying it one interval removed from the frequencies of all letters except Y and P, for, as the peti- 
tion of each letter is determined, it can bo automatically omitted from the succeeding calculations. 
The data axe os follows: 
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It is seen that the index for letter C, vis, —.20, represents the greatest degree of ooincidenee. 
But the index for A is —.25 and that for R is —.30. In other words, the index for' C is only 
.05 greater than that for A, and .10 greater than that for R. The question then arises: Is a 
difference of .05 or .10 in favor of C over A and R, respectivdy, a significant difference? In 

> T]:o method which Isos been given hero for dotorniiniug the index of coincidence fa not very effective when 
the aJphfilictA are very much different in eiie. The inatbematfoai etusly mentioned in Note 3 on page XI has 
provided a method which fa effective In each oaaoe, and tiitb its help a great sleol of work eouU have been laved 
in the following calculationa. For, oe soon as the proper pontion of the P eequenee of fxeqaenefae with leepeet 
to the Y doquenoe bod been determined, the two oould have been oomUned ao oa to yfald a boae of twice oa many 
letters. Similarly, the knowledge that P fa folkm-ed by C would permit an adjnnetlon of faequeneiee 

to the base. Since the accuracy obtained in matching Inereaeee with the number of lettere involved in the teet^ 
the further rosulta oould have been obtained with mueh fan dUBanlty. 
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oMtor Avunls, miirhi iint (lie Ivltc'r A or R foIloi\' P, insti'acl of C? The answer may be found 
by modifying the inctluKl in one ]iarticulor. Wo have been aujwriniposiiig Kix]iiuuc(ts with a 
reliitive ilisiilnremeiil of but one iiitorval. If now wo siiperimposo scqiioncos willi a n^]litivo 
displacoincnl of two intcrvnlH, tho error, duo to llio fnilure to toko into account tlio break in tho 
numoriciil key, will be greater than it is with a rolaUvo displacomont of ono interval, for now 
there will bo two incorroct-iiairs of suporiinposod letters, but still tho results will bo significant. 
Lot us, thcreforo, tost out all tho lottors whiuli arc loss than twico the index of C, with tho Y 
soquonco reinovod hoo vnUndU. Thus with A, tho soquonces aro in this position: 
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Tlio data for the letters which may possibly follow P, when tested with ^e Y sequence 
at two intervals removed, are as follows: 

Tabus XT 
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Those calculations show conduavdy that C follows P in alphabot 1. In the tables diowiug 
tho calculations for the reconstruction of alphabet 1, whonovor tho first calculation, using 
ono-intcrval data, fmls to show a letter whoso index of ooinoidenoo is at least twice as great 
as its nearest rival, a secondary calculation will bo made using two-interval data. In two 
eases, viz, for tho lottera following K and Z, it will be noted that tiiroo-interval data were 
employed to dotormino the cozxect letter subsoquont to on incondusive secondary ealoulation. 
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Tlie tnblcs ootttainiiif' tlie rest of tho data for the rcconatruotion of alphabet 1 are given below: 



T. 1 BI.K XII . — Data far reemulruetion iff primary dptuM No. 1 
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Tabli XII^ — Data for reeotutnteUoa of primary atphabel No. CSontinuod 
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Tlio iioir completed alphabet 1, ivhidi is \rrittcu tontatirely on a dialt and mounted upon 
tlio base, is as follows: 



1 3 1 i a 0 7 8 » 10 i: U 13 14 IS U 17 U 19 30 31 SI 31 M Si 36 

Alphabet 1 Y PCOUGAKDVXRSHVBINQUFZEJLT 

This reconstruction is, so far, purely the vc>.ult of hypothesis. MoreoYor, we do not as yet 
know where the break in the iiuiiicricul Ite^' rnincs, a Jid this wo shall proceed now to ascertain. 
There are acveral mctliods. We might, for cxo-n.ple, assume that the fli-st segment to the right 
of tlio plain segment boars the number 1 of the utimeiicol key. Tlicn, turniug to tlio secoiidory 
alphabet applying to tliat segment in table VI IT (tlie bust column), wo note what the plain>tcxt 
equivalents for the letters A, B, C, . . . Z, wouui bo on Ihis hypothesis by actual trial with tlii« 
alphabet mounted on the base disk, comparing tlic frequencies given with tlieir normal e:<- 
poctancy. Tims, tlio lottora D, M, and Y — ^witli h’ec^ucncics of 4, 5, and 5, respectively— would 
represent tlio letters K, Q, and T, respectively; tiiis would be for. iOram a good agreement with 
expectant^. Hence, wo would conclude that the first segment of our base disk does not bear 
the number 1. Let us assume that the second number of our cycle applies to tlie first segment 
of our base disk, and proceed again to note the plaiu-cext lottois cotTesponding to this assump- 
tion. The cipher letters with tlioir froqucncii's and ocpiivalents would be as follows: 



I'lUbllK IS 



Cipher ABCDEFGHIJKLMNOPQRSTUVWXYZ 

i'iaiii text-.- GWPKZMUSBEAJQICYNXRLODHVTF 



Frequency ^ 



Thu agi'cement of tlioso frequencies with tliuir uonnal expectancy is fair, although not 
sinking. Thci'C are too luniiy occurrences of 1ow-tnaiiic:icy lottora, G, P, K, and Q, and oltliough 
the friMpicncies for S, 0, 1, N, and R arc excellent, there are not enough occurrouces of tiio high 
fi'e.(;iii:ncy letters, T, A, and S. ' Wo must give tliia hyiidthcsis further sci-utiny. If the first 
segmoiit beiu-s the nuiiibcr S, then the second s<^mcnt vrould boar the number 12. Again let 
us match tlie frequencies given with their coiTespondii^ plain-text letters on this hypothec. 



I'lGCKE 10 



Cipher ABCDEFGHIJKLMNOPQRST.UVWXYZ., 

Plain text-.- UHYAFQORWZGENBPTIVXJCX-SDLU 



Frequent^ '' $ 



This assortment of letters is also fair, but still the evidence is not condusivB. We might 
continue oloiig these lines, attempting to establish definitely that the numerical key begins 
with tlie nmiibcr S, or does not. But let us tiy another method. 

Beferriug now to the Y sequence of frequencies in table VIII stall retain^ the assumption 
that the mimciical key bqgins with the. number 8, the Y sequence -diould be broken and re- 
arranged in oi*dcr to conform to the assumed break in mimeriool in this manner; 
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Below these frcquuiudcB let iis place tlie plain-text letters which they Topresent upon the 
assumption that the break does occur between Uie numbers 1 and 8 in the koy. 



Fxeutus 21 




Tliia too, loolcs like a fairly good ossMirtmcnt of lugh-fraquency letters, with the single 
exception of T. It niaj' be tliat wo have redly struck the eoiToet place for the brook after all. 
Lot us corroborate it by still anotlicr method. 

Assuming the break to be os shown in figure 2d let us make a list of the cipher letters which 
should represent E in the succesaivo seguumts. Thus: 

Fienics 22 




Let us turn now to tlie respective frequencies of these cipher letters as given in table VII, 
taking the frequencies of the letters J, L, T, . . . in the A columns, successively beginning 
with segment 8. 

Fxcuhx 23 




Equlvaleots of plsiu-toxt E.J J 



Frequoncy. ....... 




iBgDQQaOl 

lar 



On the whole, this is as good as can bo expected for the smoU number of ocouirenccB in 
each table. Let us do tire some for tiro letters T, 0, and A. If the results are os good as those 
for E wo may conclude that wo Imve really found the absolute positions of the numbers in 
the numerical key: 

Figube 2t 
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Tlirsc (lisirlbutions and frcqnondos aro cortninly excoUont and we may regard our numuiral 
key as cstahlisltcd. Tlic initial segment after tlio plain is number 8. (See Addendum, p. 50.) 

conic] ]mjcccd to reconstruct iilplinbets 2, 3, 4, and 5 by exactly tlio some principles os 
vero used in tbo reconstruction of ulplialjot 1. To do so would bo purely of theoretical interest 
bcciiusc it would represent a cose where the i-oconstruction of five primary alphabets, from 
the frequencies of 115 unknown secondary alphabets, is accomplished without a preliminary 
tentative decipherment of even so mucdi as a single word. In dxort, it would represent a cose 
where the cryptanalyst, without attempting the decipherment of any part of the text, comes 
at once, after such a reconstruction as ^e above, to 1 m in the same position as the correspond- 
ents, and can decipher any message as rapidly as the legitimate reciinentB. 

Where a staff of clerks and experts is available, this method would indeed be followed, 
for tlie personnel could bo divided into five groups,, each group being assigned an alphabet to 
reconstruct. Each group could be subdivided into two seotionB, one working forward from a 
given letter, the other working backward from the same letter. After not more than 3 to 6 
hours all five alphabets will have been reconstructed in their entirely. Or perhaps it would 
be more practicable to reconstruct only throe of tlie piimoxy alphabets, say the first, third, and 
fifth, filling in the other two from the resulting decipherment. 

In the present instance, however, only alphabets 1 and 3 were reconstructed, the recon- 
struction of alphabet 3 being successfully accomplished by the application of the' same prineiplcs 
as wore used for alplmbet 1. Then a partial decipherment, in wUeh the repetitions of digraphs 
and trigraphs witliin adjacent columns ployed an important part, led to the reconstruction of 
the other three primary alphabets. 

Tlie daui for tlie reconstruction of alphabet 3 are given in table XIV. Since the location 
of tlio break in tlie numerical key was dotermined after tlie reconstruetbn of olpliabet 1, the 
columns in table XIV, upon which the reconstruction of alphabet 3 is based, are given in the 
correct numerical key o^er so that any two sequences of frequencies could be superimposed 
at any intervals. However, one-interval and two-interval data were used almost exclusively 
except in one or two doubtful instances where greater intervals were employed. 



a U h C9 
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Table XIV . — TMaJor reanutruclion primcrg alphabet no. S — Contiiiiiod 




The completdy reconstnioted alphabet 3 ia as follows: 

Alphabo-: 3.... RXLADZQFSVKOEJPQHYUUBINCTW 

With alphibeta 1 and 3 at hand, the partial dodpherment of the initial groups of a few 
messages soon loads to the complete rooonstruction of the otlier three alphabets. For example 
note what tliose two alphabets give for the beginning of message 11: 

Segments 11 5 19 

Alphabets.... 1 2 3 4 5 1 2 3 4 5 1 2 8 4-5 

Cipher. SEYBZ 1I6S0Z CUPSQ 

Haintext 0 S V T N 

1SS451S3451 

The word OBSERVATION comes to mind almost at onee. This means that we have 
detezmined the following values: 

Alphabet 2 Alphabet 4- Alphabet 5 

Sqpnent 11, 3^—E, Segment 11, Segment 11, RpnZ, 

Segment 5, A,»G, Segment 5, lp=0. Segment 5, Qp^Z, - - 
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Nctv vfllucs thus dotrnuinod are iusortod in their conoot poaitioiu with the result that 
after a short time alphabets 2, 4, and 5 are complotdy reconstructed. The five alphabets 
arc fou!ul to be as follo^v8: 

Alphabet 1 yPCOUGAKDVZRSHWBINQUFZEJLT 

' .(Upluibet 2 EQUSLPYVCJTAKHUZ.FBRXIGNWDO 

Alphabet 3 RXLADZQFSVKOEJPQHTIIUBINCTW 

Alphiibet 4.... ATEVXZYNFGSBINQRPCMDKWOJUL 
Alphabet AEFBNKLTIXUVORYWHQJUGCZPDS 

Proof of tlicir correctness may bo at oiico establisliod by dodphoting the first few groups 
of message 1. They are as follows: 

MLVXK QNXVD GIRIE lUNEE FEXVP KPVZR UKSER UVQCI etc. 

EVENT NGREP ORTSS . HOULD INCLU DEDAI LYLOS SESSE eto. 

“Evening reports should include doily losses ..." 

Once the piimoty alphabets have boon leconstructod, the solution of subsequent messages, 
written by means of them but employing a urhoUy different numerical key, is very easy; for 
the cz^'ptanalyst has only to set the fust 5 letters of any message in one segment and look for 
the plain-text beginning of the message in some other segment. Cnee the starting point has 
been found, the number corresponding to tlie serial number of tlie message con be inserted in 
its proper position, and all otlicr numbers of the key determined in a simihar maimer. 

Tliis ci])bi>r in of inteiwt also in view iu‘ it^ .-«Iriking siiiiilnrity Ui tbo Bn/icrics Disk Cipher, 
III' lim "liliir ('iiilii-r”, l|l•!ll‘nlli«ll in n pri-unn;-. Iiii|hm- ' b,v Mm niithor. 

ill till' Inlii'i' fipber, iiiiili'inl nf only priiiiiiry nlplinbeUi, thi'ro iiiv 2U to 20, iiiul iiisteml 
of liiiviiig n miiiicricol key to dclcriuinc wlieru Mi« ciphci' equivalents arc to bu taken, iio buy 
of that nature is used, or needed, since the correct line, or generatrix, is easily found because 
it is the only one that gives intolligiblo text throughout its length. The segments in the Vogd 
Ciplier correspond to the generatrices in the Star Cipher. In the former, the cipher letters are 
taken from a definite cycle of generatrices, and not all generatrices are used; in the latter, no 
such cycle obtains, for it is unuccessary, and oil 25 gonoratiices may be used at random. 

The fatal defect in tills cipher, from the point of view of its practical indooipherability, is 
tliat the segments, or generatrices, are usorl in a definite sequence or cycle, giving rise to an 
internal period within mes.sag 08 and an external period between successive messages. It was 
through them tliat the recovei? of the numerical key and the primary alphabets was possible. 

. AnoRVonu 

The method presented above for deb^rinining ilio index of coincidence will in most cn»s lend 
to the roj'i'cct fitting of two frequency distriburions. Since writing tlic foregoing descri’.rii.- • 
of tlic method for determining the indtuc of coincidence, tlie author has applied another 
wlucli, altiiough it involves odditiond calculations, will losul to even more accurate r'l-ulrs. 
Since cn-ses which may nocessitato a greater rofinemontin method than that elucidated nhoi’o wJII 
arise, it may be wortli while to present the more detoilod method. 

This method more dosoly approximates the actual methods used by statisticians and biutne- 
tricians in their anolysu of data, in that it involves the squaxee of deviations or differences 
between con-osponding observed values. 

> Sottcrai Maehint Cipktra and HiaOuda for Uioir SeluUm. Bivetbaok PubUoation No. 20. (lOlSJ 
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Let us consider tl-.c Y rii*.-;ii:';ico oi i'roquniii'u s of i.;;Wc VTil. We to find lliet distribu- 
tion wliicl), wjicn s!)ifti'd oi:c j::!ervnl to liic icit of Uio Y dL>tributiou, will most closely Approxi- 
mate tlie Y distrihulion; l.c., comes from tliu smne "parent” distribution. Retiirniog again to 
figures 13 and 14, we have the following: 

, Fiacns 25 
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Pc.arson * gives, in his discussion concerning "Testing Goodness of Fit”, the method for 
dotennining the probainlity that random sampling would lead to as large or larger deviations 
betw'con observations thon those observed. 

We shall here summarize the paper referred to. Lot the sam^ea be given by the frequeneiea 
in the same class as follows: 

First sample: Ju Ja, fa, ... N 

Second sample: /f, Ji, fi, ... /•< N* 

where the totals N and N' differ as little or as widely os we please. The value x* 9ven 
1 *' fiV'/#— WD* 

X*" ^ — Jt+f ' ^ calculated. Tables’liave been calculated which give, for various 

values of n', the probability of obtaining a value of x* as big or bigger than the one calculated. 
In matching alphabets wo must be careful to choose the proper value of n* in the tables. It 
is known that not all 26 letters will appear in a monoalphabot until the total .munber of letters is 
about 300. The munber of different letters corresponding to various total lettera per monoalphabet 
con be determined, and a curve plotted that will enable one to obtain this value readily.* The 
procedure in applying the method for matching is as follows: 

1. The value of x* is calculated os above. 

2. The number of different letters, n', corresponding to a total of JV+iV' letters is 

obtained from the curve. 

3. In the tables for x* the colunm n' gives the probability coireqponding to the calcu- 

lated value of x’. 

This probability expresses the chance of getting another set to yield a value of x? st least 
as large as the one obtained; in other words, the closeness with which the two alphabets match. 

T.ct us proceed to find the value of P for the two distributionB shown in figures 13 -and 14. 
Wo first find the differences between the weighted frequencies in the respective Bupetimposod 
sequences and then square the differences. Thus; 

> Pcanon, Sari. On the pnbabilitg that two indepmdetU duirihuiiont qf fnigamey ttnrtoUy Mn^to/roM 
the tamo population. Biometrika, voL 8 (1011), pp. 250ff. 

* See, S. KuUbaek, loo. eit.. Table IV and ohait 1. The table and chart an reprodueed oni pagea 01-84 of 
thiapaper. 
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Next, wo must divide each difforonce squared by the sum of the cotrespondizig origmal 
frequencies. Thus: 



Tablk XV 
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To find X* we must now add these quotients and divide the sum by NXN* or 22X36. 
The sum of the quotients, 18,628.8, when divided by 792 gives x'™23.6. In this particular cose 
the further refinement was not applied. Since there ore 22 segments we look up the table for 
x‘==23.o and n*—22 and find .32 which meazia that in 32 of 100 cases wre would gat by chance 
a match as bod as or worse than that observed. 

Wo would ordinarily then go through exactly the same steps for all the other sequences in 
our table \TII against tlic Y sequence: the sequence vdiich gives the greatest value for P would 
be the correct one and would indicate wiiicli letter follows Y in alphabet 1. But it will be 
unnocossaiy here to go through all the calculations since it is dented only to show that this 
motliod will give more trustworthy results than the method of coincidences and nonemneidenoM 
presented above, and we shoU, therefore, sliow only a few coses. 

Lot us take only those five sequences in table VIH which when tested ogainst the Y se- 
quence by the previous method gave the greatest indices of ooimadence, viz, the P, Q, N, L, and 
j sequences whudh gave indices of coinddonce of —.12, —.23, —.24, —.28, and — .29,.zespep- 
tivdy. Tlie data for these sequences upon the method of squares are as follows:. . 
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The values of /’ (index of probability or colccidcnec) sboiv thai tha probability that the 
P sequence belongs one !:ucni-al to tho left of the Y soque:u;o is a littio more than 1.7 times as great 
as tliat of Uio nearest rival to tlie P sequence, viz, tiie N saquonco. The result is quite gratifying. 

We sli.ili shou' tmuihcr ease. For this we .shell cliooso a case in which a terthuy trial had to 
bo made on tlic iirovious mctliod — t1i.at is, tho inntliod in which a trial with sequences only one 
interval and two intervals removed failed to give denuito and clcoivcut results necessitating 
a third tost with a sequence tlnec intervals removed. Note tho diiDculty in finding tho letter 
whidi follows K in table VIII. Hero even the s:*.cond trial foiled to show whotlicr D, H, Q, or 
Z follows K. Lett us test those sequences ngnuist tho K sequence usii^ tho method of squares. 
Tho data arc as follows: 



Tadui XVII ' 

Total 



! 


2 


2 


B 


B 


5 


1 


2 


1 


4 


8 




B 








1 




2 


a 


B 


1 


1 


B 


B 


I 




B 


■ 


K 


h" 






3 




1 


1 




2 


1 




6 






r" 

4 


2 




4 


s 


B 


B 


fl 


fl 


B 


§ 


B 




1 


2 


1 




» 




1 




1 


4 


1 


B 


1 


1 


B 


1 




lH 




1 


B 


1 


1 


B 


gj 


]d-31.7 


P-.42 




K 








3 


5 


1 


1 




2 


1 


1 


B 


1 


1 


B 


B 






5 


1 


1 






B 




a 




4 










8 


2 






1 


1 


B 


1 


1 


fl 


B 


1 




1 


i 


1 


1 


1 


B 


u 


OH 






X 








8 


5 




1 




2 


1 


1 


B 


1 


1 


5 


2 




4 


5 


3 


D 




1 


8 


89 


Pi 




1 




4 


2 


1 




J 


2 


a 


! 


1 


1 


I 


1 


§ 


1 


a 


5 


B 


1 


fl 


1 


B 


82 


x»7ir.r 


P-.65 




1 






3 


fi 


1 


1 


B 


2 


1 


1 


B 


B 


B 


B 


B 


1 


4 


& 


fl 


B 


1 


B 


B 


89 



The difference between D and Z is not conclosivo enough to decide definitely yet, but H and Q 
can bo eliminated at once. Testing for interval 2 will’ show that D is the proper letter. 

It is to be noted that for our purposes it is not even necessozy to find the actual value of P; 
for given constant in value, P increosos as x* decreases, not in a regular manner it is true but 
approximately so. Therefore, if n* is constant throughout a series of colpulationB, the index of 
coincidence in our case will vary in a general mimner inversely with the value of x*i the lowest 
value of X* indicating the greatest degree of ]»obabili^ or the greatest index of coincidence. 

It is behoved that this method of squares will bo found exceedingly valuable in many other 
cases where considerable refinement of method is nocessazy to pzoduoe dloar^mt reeulta in fitting 
frequency distributions to one another. 
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TABLI? IV.* Teatjor Gowlnf«» of Fit. Values of P. ' 





n'=>3 

■ 


B 


B 


B 


B 


B 




■falO 




1 


■G0G331 


■801253 




■0C256G 


■98S61S 


■004830 


■ppilB 


m 


■0008SS 


n 


■807873 


■573-107^ 


■735750 


■840146 


■010CS» 


■050840 




6014CB 


mm 


s 


■S93180 


■301(:S5 


1557825 


■609086 


•808847 


685003 


634367 


664205 


•0S1434 


4 


■135335 


■3Giiaii 




‘54041G 


■C70C76 


•779778 


657133 


61U13 




D 


■08S0S5 


•171707 


■287293 


•415SS0 


■543813 


■059063 


•767576 


634308 




B 


■010787 


•IIICIO 


^90148 


■306310 


•423190 


■530750 


■647333 


■730919 




D 


-080197 


■071807 




■220C40 


■3208-17 


•42SS80 




■037119 


BPffr 


8 


-018310 


■ouxns 


091578 


■1SG236 


■238103 


■332504 


■433470 


•534146 


628837 


0 


■011100 


•030301 


■001009 


■1000G4 


'1735';8 


■353656 


648S9C 


•437874 




10 


■00C73S 


■018566 


•0404S8 


•075235 


•IS loss 


188573 


665036 




•440498 


11 


■0010S7 


■■011790 


■036504 


■061380 


■088376 


138010 




■876700 


657618 


13 


■002170 


■007383 


017351 


■034787 


■061000 


100568 


161804 




685057 


m 


■001503 


■004037 


011370 


■033379 


043030 


■072100 


111850 


163607 




li 


-OOOOIS 


■002905 


■007305 


•015000 


O20C:iC 


■051181 


■081765 




178003 


IS 


■000553 


•001817 


■004701 


■010303 


■020250 




■059145 




1320G1 


IG 


■000335 


•001134 


■003010 


■00CS44 


013754 


■035116 




EB] 




17 


■000303 


•000707 


■001033 


■004500 


■000283 


■017300 


690109 


648716 


674364 


13 


■000123 


■000440 


■001234 


■003047 


OOC232 


611970 


■031826 


■035174 


■054964 


10 


■000076 


■000373 


■00078G 


■001022 


O041G4 


■00S1S7 


614860 






SO 


■000045 


■000170 




■001250 


■0027G9 


■005570 


610336 


617013 


630353 


SI 


■0000S8 


■000105 


■000317 


■000810 


■0018S5 


■003770 


■007147 


613650 


■031003 


S3 


■000017 


■OOOOG5 




■000524 


■001211 


■003541 


■004916 


•008880 


615105 


S3 


■000010 




■000187 


■000338 


■000700 


■001705 




•006107 


610747 


S4 


■000000 


■000025 




■000217 


■000522 


601130 


■003303 


604301 


■007600 


85 


■000004 


•000016 


■000050 


000130 


O0(»41 


000750 


601564 


603071 


•005345 


se 


■ooooos 


■OOGOlO 




■000000 


■000223 


■000504 


601050 




■003740 


S7 


■000001 






■000057 


■000145 


■000333 




•001300 


■003004 


SS 


■000001 


•000004 


■000012 


•000037 


■000004 


■000330 


■000474 


600054 


•001805 


SO 


■000001 




■000008 


•000023 


■000001 


■000146 


600317 




■001346 


SO 


•000000 


■000001 


•000005 


■000015 






■000311 


600489 


■000867 




■ocoooo 


•000000 


■000000 




•GOOOOl 


■000001 




■000008 


•000017 


1 so 


■030000 




fcTiXrMB 




Ki • • I't'ii 


■000000 


600000 


600000 


600000 


1 

so 


■OGOGOO 












■000000 


■000000 


■000000 


1 90 

L._ 


■ocoooo 












■000000 


■000000 


■000000 



■ CoiiUhI ImB ZWIa /ar fltatMMMH «ii< Bhmdrklnu, BdlUd by Khd Pmood. Fait ]« SaA Bd.. CamMdii 
VaiVTiatty. ^ 
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TABLE IV — {eoTiiinTudi . 



^ I b'bIS n'BlS h'bII I n*a!5 K'slG n'alT flTaM | k'slO ft's20 




>i wy.'* 'iiiirl 



•S00774 



■0007SG 
■809917 
-9311G7 
■6733G5 
S 

8 713301 

9 ■021882 

10 -530387 

11 -443SG3 
13 -308043 
13 -SOSSOO 
U 

IS -182493 

10 741130 
i7 -10:1876 
IS -031581 
19 -001004 

50 -045341 
31 -033371 

■084374 
S3 -017670 

51 -018733 
-000117 



£7 -004585 
S3 -003338 
39 -002270 
SO -001585 



-895544 
■083430 
■067978 
■010033 ! 
■657013 I 
785131 j 
702031 I 
■616000 
■688010 
-445030 
■3GQ041 
■900708 
■841430 
■101230 
■140507 
■115001 
■088529 
■067080 
■050380 
■037620 
-027786 
-020341 
■014822 
■010734 
007727 



PTikTiW 



■901101 

-076103 

■940153 

■902161 

■843001 

772043 

■093934 

-010317 

■027043 

-4-1781S 

•373814 

■307354 

■240129 

■190304 

■157620 

■123104 

■006210 

■078020 

-056302 

■041077 

■031130 



■003999 

■990017 

■009074 

■0954C0 

■985818 

•000:91 

•9.H711 

•&150337 

-831051 

702183 

■CSC03C 



-S-20524 

^l4^711 

■373101 

313374 

•255178 

■806781 

164049 

■130141 

■101633 

078614 

■000370 

■045882 




■007737 






■957060 




1* 

■OSS90O 

■090830 



■005754 



■973200 

■948807 

■013414 



76ii504 


■600485 


■079033 


748080 


■C02398 


■672758 


■G23620 


■608714 


-451418 


•534633 


■3S2051 


■452001 


■31SS04 


•365507 


■303006 


■8S3S07 


■813734 




771033 


■230220 




•178510 


707804 


743181 


■084140 


713735 


■005003 


089504 



•002793 



■017001 -025887 
■012441 -010254 

■014333 
■000546 I -010150 
■004710 -007032 






•038730 

■031560 

■010085 

■011031 



■041483 

■031630 



1* 

■990097 

■990031 

■999483 

■997771 { 

■903187 

■9S3349 J 

•0CW17 ^ 

■040261 

■003010 

■650504 

■6C013C 

730186 

■007102 

■593482 

■58383-4 

■4543G6 

■383841 



■274889 

■220201 

784710 

740851 

719435 

•094710 

■074461 



•039872 ! 

■909703 

■90SSGO 

■086107 

■903125 < 

■078637 j 

■050743 ; 

•031000 

«04357 

■847837 

701573 

729091 



•503547 

■533103 

455053 

‘391623 

■S3S819 

•279413 



790580 

765028 

734015 

■090758 

•078900 






■034528 

■020345 



■048379 

■087440 



■284258 

■237842 

780102 



•130188 

■104653 

■083488 

■066085 

■051788 



iO -000036 OG0073 -000138 ■000256 -OOOOOS -GOOTTS <Xn294 -009087 003272 

CO -000001 -000001 -000003 -000000 -000012 -000083 -000043 -000075 -OOOUl 

60 -000000 -000000 -000000 -000000 -000000 •000001 -000001 

70 -000000 -000000 -000000 -000000 -000000 -000000 -000000 I <000000 
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■OOOdSO 

-9998G8 

■090137 

■998142 

■09Q143 

■089314 

■078012 

■0CS787 

■039017 



■099093 

■990030 

■09970S 

■998080 

■007100 

■093331 



221 



■609309 



•760650 

■711108 



\m 



3^ n'ir21 n'-S3 M's 



!■ 

1* 

■990098 
4 I ■099934 
■9sW23 
-J0s.=^ 
■90GG35 

8 -OOISGS 

9 iiSSiXff 
JO -OCSITI 
IJ ■04CS23 
JS -910070 

■877384 
J4 -SSOIOO 
JS ■770408 
J8 •710024 
J7 •052874 
J8 ■387408 
J9 

SO iiSTOSO 
81 -397132 
•S40S11 
SS I -SSSTOS 

S* 

SS 
SO _ 

S7 ■135284 

sa ^09399 
£8 -CS7759 
SO -000354 



TABLK lY — {eauthiutdi 



if-24 aTaOS i^«30 «'b 97 af-QS I aT-fO aT-SO 



■07 
■057379 
■031 
■00 



!• 

1* 

1* 

■099007 

■009973 

■OOOS65 

■000462 

•998371 



■091277 

«83189 

■970470 

•951900 

■0SQ871 

•804834 






■763302 



1* 

■000090 

■009037 

•009039 

■900711 

■099085 

■997505 

■094347 

■039012 

•07S90S 

■9GC121 

■040050 

■020759 

■888078 

•64SG03 





r29 
•247164 



■170SS3 
440161 
•114003 
•09 



476681 

•144801 

‘118404 



•767489 

•701324 

•641912 

•681087 



•400771 






■850286 



■216781 

480310 

449402 



•751090 

■896778 

•038726 

•579207 

•519708 

•461597 

40S700 

•353185 

•384453 






•230131 

484753 



X* 

1* 

■090994 



■099851 

■999494 

■998590 

■09GG53 

•992940 

■080507 

•076501 

■001733 

■041383 

■014828 

•881793 

•843390 

■7071S0 

■746826 




■0D0720 
■000194 I ■099548 
■007081 -OOOBOO 



1 * 1 * 

!• 1 * 

1* I- 

I- 1* 

!• 1 * 

■009007 -OOOOOO 

•900091 
•090024 I -99 
■990748 ' 



.,1 






73 
■083074 
■073000 
■957334 



•894804 

•080247 

■881264 

•969433 

•963947 

•931122 



•998316 

■906372 






■937189 

■978436 

•905819 



■997738 

■995384 

■991377 

■965016 

•076638 






■030149 •044272 



■635744 

•677504 

•510373 

•403373 



•353884 

■307863 

■303016 

•234289 



•791958 

•741984 

•686687 

■632947 

•676806 

‘518875 

■403105 

■409333 

<U8468 

•311082 

■207611 



•870001 

■630766 



•804464 






■737377 

•886013 

•630316 

•674482 



•410973 

•800899 

•314164 



•781291 
■733041 
■68 
•827886 
•678046 
*618247 
•404447 




•859149 



■776343 

■728933 

■G7SS48 

■036401 

•571709 

•517913 

•483066 

•414004 



40 -004095 -007437 -010813 4)16309 •031387 •039164 -OSOOIS 4)51227 4)66128 4)63837 

SO -COOSSl 4)00306 4)00586 4)00921 4)01416 4>C6a31 4)03144 4)04651 4)0646^ 4)09032 

00 4)00007 4)00013 4)00032 4)00038 4)00004 4)00104 4)00168 4X)Q804 4)00407 4XX>618 

70 4X)0000 4)00000 4)00001. 4)00001 4)00002 4)00004 900007 4)00011 4)00019 4)00030 





U«>tOO— 3S S 
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Part II . 

THE -SCHN2IDER CIPHER* • 

Details of Enczprbricbmt 

This system aims to make the entire operation of encipherment end dodpherment dq>end- 
cnt upon ^0 knowledge of a angle key word agreed upon in adyonce by the corroapondents. 
Let us suppose the word tobeTREBIZOND. 

An iirbitrorily mixed alphabet is derived from a generating rectangle based upon this 
key word, ivith a slight departure from the usual method. Instead of inaftrihing the letters in 
the generating rectangle in the key-word sequence, as is usual, the initial letter of the second 
lino may bo any letter except one in the key word itsdf. The remaining lettmh' then foUow in 
the key-word sequence! Thus suppose wo eliooso K as this initial letter for the second line. 
Our rectangle is constnioted in this manner: 

fiGun 27 

TREBIZOND 

KLupasuvir 

XYACFGHJ 



By taking the columns in succession and writing them in two lines of IS letters each we 
have tha following: 

Figubx 28 



Alphabet ^ | q 



KXRLYEUABPCI 

FZSGOUHNVJDW 



Those two lines constitute alphabet 1, in which TssQ, K^F, Xs=Z, etc. The values 
are all reciprocal. 

All the other olphabote ore derived from alphabet 1 by the simple oxpedioat of carrying 
the upper half of alphabet 1 to the third lino and revolving the sequence one letter to the rig^t. 
Thus: 

Fioviifl 20 

|(1) TKXRLYEIIABPCI 
Alphabet I|^2) QFZSGOUHNVJTDWl Alphabet 2 
(3) ITKXRLYEHABPCj 

The jiixtoposition of linos 2 and 3 rosulhi in the formation of olphabet ’2, in which Q»I, 
F=T, Z-=K, etc., also completely reciprocal. 

' Scliiii'iilnr, L. Dfneription d'un nynlhtM erujJnijra'flhiijue A I’wngc do I’annit. Purls, 1012. This cipher 
TTUh K::liiiiii.ip:!il foe oxainination by Moj. Otto IFnlstcin, .M.I.-llcs. who fiircislicd the c^ter, who was tlkca 
at till- lUvoriii-.iik Idlwratorios, with a tnvnslatlon of CoinsuariiJant Sohuutder's lupcr. Attempts to locate 
in Aii-iurii-Mi lilirarius and bookstores the work cited to sco if tlio inventor lioil offered a samplo message for 
solutuiii i-nived of no avail; fortunately, a copy of tito papor was scut in quite accidentally by an olMCuro book 
dtuter shorliy after ttiu first draft of this manusorlpt was prOporud. No .sample nuiSBaEB is given. Upon xny 
request an u>sist.iiit prepared tlio message which will presently be given, and the sidution of which was attained 
by the iirinciplcs to be elucidated. 



( 05 ) 
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Alphabet 3 is conBlniel4Ml by moving lino 2 to lino 4, rovolving the hri{iic.iico lisa loltera 
to Uio right. Thus: 

Fii.URC 30 



Alpliabot 1 I 
Alphabet 8 | 



TKXRLYEUABPCI 

QFZSGOUHNVJDW 

ITKXRLYEUABPC 

OWQFZS60UMNVJ 



Alpbabot 2 



The juxtaposition of lines 8 and 4 results in tho formation of alphabet 3, in which 1=0, 
T=U, X=Q, etc. 

Continuation of this process can rosult in tho production of a total of 18 different secondaiy 
rcdprocal alplmbots. As a rule only a linutcd number of these secondary alphabets are 
employed, usually not to exceed tlio first 6 or 7. 

Gnc of the mnin features of the motlivid of oneipliennent is tliat groups of uiiMiiinl Icngtlis 
fire euripl.ercJ in cyclic fashion by means of sorer:'.! niphabets. That is, tho text is broken up 
into groups containing 1, 2, 3, . . . letters enciphered by different alphabets, the groups 
being repeated in sets, as explained below. 

Let us, as one of tho corrosx>ondents, doteimino that the cycle ia to ooneist of ux groups. 
Taking 'the first 6 letters of line 1, that is the upper ludf of al|diabot 1, their numerical values on 
tho basis of their relative poritiona in tlio normal or straight alphabet, are oa followe: 

T K X R L Y 
4 1 5 3 2 6 

This sequence of numbers, 4-l-5-^-2-6, constitutes a qyclo designated hereafter as riie 
“Interruption kqy.” It partakes of tho nature of an “interrupter” in that it dictates tho 
number of lottors in each of the groups of irregular length treated in endphorment. Thus, the 
first gimip contains 4 letters; tho second, 1 letter; the third, 5 letters, etc. The seventh group 
would begin a repetition of the cycle, and it woiild contain 4 letters; the eighth group, 1 letter, 
etc: This cycle is repeated many times witliin a message. 

Endpheiment witliin each group is regular in tho suocosdon of the alphabets employed. 
Thus, if tliroe alphabets are dotoiminod upon by tho coirespondcnta, the alphabeta would be 
employed in tlio following sequence in the interruption key given above: 

Fmobb 31 

Length of group 4 1 5 3 2 6 

Alphabets 1231112 31212312123128 

Note that in groups containing more letters than the number of alphabeta dedded upon, 
tho sequence of alphabots ia repeated. Thus, in the 5-letter group, we have the sequence of 
alpliabeu 1-2-3-1-2; in the 5-letter group, l-2-3~l-2-3. Endpbennent then proceeds by 
alpkabcts according to the distribution of numbers within the groups. For example, group 1 
oontolniug the soquonee of numbers 1-2-3-1, the first letter is andphered by means of alphabet 
I ; tlie second, by alphabet 2, and tho third, by alphabet 3. The fourth letter, however, is again 
endpliercd by alphabet 1. 

After cnciphermont, the order of the dpher lettore within the groups is reversed throughout 
tho nicssago. Thus, suppose that the ondphoiment usiog the three alphabets and Issy above 
were us follows: 
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Fioimn 33 



4 


1 


5 


.3 


2 


fi 


4 


1 


5 


3 


13 3 1 


1 


1 3 3 1 S 


13 3 


1 3 


13 3 13 3 


13 3 1 


1 


I 3 3 1 3 


1 3l 3 


E N E M 


y 


I S I NT 


REN 


C H 


I NG A L 0 


NG WE 


S 


TERNS 


LOP 


U 1£ 0 H 


0 


W X D A F 


SHB 


D E 


WMYNOE 


A R T U 


R 


QHZ AX 


GL V 



Tho lottoTB within tho cipher groups arc tlion roversod yielding the following: 

FiacBE 33 

HOMU 0 FADXW BHS ED E 0 N Y If W UTRA R XAZHQ VLG 

It is now necessary that the information necessary to decipher the message be conveyed 
to the recipient, who, of course, is already in possession of the key word. This information is 
transmitted in tlio form of on " Indicator group ”, consisting of 3 letters, whose location within the 
message has been previous determined in a manner to be explained presently. The first letter 
of tho indicator group gives tho initial letter of tho second line of the generating rectangle; the 
second, the number of alphabets employed; and the third, the length of the interruption key, 
from which its sequence can bo derived from the upper half of alphabet 1, as already explained. 

In the cose above, the initial letter is K. The number of alphabets being three, the third 
letter in the upper half of alphabet 1, viz, X, forms the second letter of the indicator group. 
The length of the interruption k^ beii^ six, the sizth letter of the upper half of alphabet 1, 
viz, Y, forms the third letter of the indicator group. The knowledge of this letter enables the 
redpiont to construct the interruption key. The indicator group for tiie message above is, 
therefore, KXY. 

This indicator group is then inserted in the ciphor text in a position dotormined by a pr^ 
viously agreed upon letter of tho key word, usually dther tho first or the last letter. Thus, if tho 
correspondents agree upon tho first letter of tiio key word, this indicator group would be inserted 
after tho twentieth letter of the cipher text, because T, the first lettw of tho key word, occupies 
tho twentieth position in the normal or straight olphabet. The cipher message above would 
read, therefore, as follows: % 

SIESSAOE 

HOJIUO FADXV BHSED EONYU KXYVU TRARX AZHQV etc. 

By varying the elements of the indicator group, a great number of combinations can be 
obtained from one key word. Schneider says: 

With a of 6 lotton, if one aoU out to modify tho taUo of alphabeta; one obtains only 
different ci?*ptognuna from the same test. 

If, in addition to the toblo of alphabota, ono modiiios tho number of alphabcta employed and the number of 
lottcra to fo.-m tho numerical key, aaauming that one va^oya 3, 4, 5, or 6 alphabets (a total of 4 oomUnations) 
and 5, 6, 7, or 8 letters to tom the numorlool key (also a total of 4 oombination^, one would obtain 20 X 4'X 4—320 
different oij'ptograms from one and the seme tenet 

Wo shall first elucidate the principles by means of which a single message may be solved, 
and then proceed to tho solution of a series of messages in the same key. 
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1 . Solution of 1 


a single message. — Given the fallowing message: 






KKMVL 


IQKGK 


NNHKV 


QEEXK 


XYXOP 


MSIEE 


TPDKU 


LISYZ 


HHBRE 


SATHR 


kz.rg:j 


NJGKD 


QKWM 


FQBKE 


NEIHA 


EAAliE 


KHFLW 


ZRKEO 


.KUHFM 


WAFTW 


ESPSB 


DDGV/P 


JPXGD 


ZVWUX 


LZAYU 


ENILH 


AIUUA 


EABRE 


PWKFV 


JJKIP 


' EiJENF* 


SBVYZ 


KWDtiK 


VLODO 


OBFISB 


BYyOQ 


YVKVI 


XBGIK 


HEONE 


EIWKV 


PAECL 


RNIHN 


NMODQ 


NIKAO 


JKEOZ 


TCFKD 


JJODV 


ZUAES 


ZDtfXG 


KXBDT 


XNKGD 


IHTKS 


NPGDU 


RQFUD 


ONAZA 


ZMWCL 


RHEOF 


ZVFHV 


KZYGE 


VPNPK 


AQAHLl 


DCCKK 


UGDJW 


ILIAB 


ECESG 


SPFEP 


KOPIS 


HHODN 


D3KFI 


SYQUZ 


KGIQQ 


KTWYG 


JV/KPK 


NVBHE 


ULJVQ 


ZZWIU 


LWVNJ 


UCDQQ 


KKFDG 


WPUFS 


KIEBC 


CDXLP 


tiDODO 


EAKKH 


UHFZT 


CtW? 


WDAVL 


UPXKR 


■ OJKFN 


UVPRA 


0E7/J0 


JWWR 


UDEWA 


FYWHX 


ENtiEN 


APRNP 


VCUCL 


HYFMN 


LKHVP 


NOJKF 


ffWCLR 


AOOffZ 


VJWW 


RMDEff 


AFYffO 


XENGZ 


FKVWP 


XUNUU 


AYMGX 


VNKOJ 


KFSV3 


ZRKLE 


lONGM 


NVFTD 


HGBJO 


LIPOW 


NPPWK 


NREPM 


DnXYE 


ZNZON 


UEVOP 


WUQQQ 


JNUPV 


FQdWN 


KKOLI 


POUilKO 


VBHFY 


UEPUD 


AFlffiiP 


ABEFD 


UKrri'fiJ 


EODRL 


ISYDK 


VZXffK 


UIQXK 


THYNS 


VHEQA 


AEVIX 


PZTQK 


lAUFT' 


TALOV 


JKVHD 


UETCL 


UHVWR 


HFUNN 


GFCKK 


ESEYE 


JUPffW 


BC 





Fbikciples of Solution' 

Thia ciphor boloaga to tlio class of combined substitution-traiisposition' me&ods. The 
substitution, however, does not follow the method of the ordinoiy periodic multiple-alphabet 
system, nor is the transposition regular or geomottieol in its nature. We aholl eonsidcr first the 
results of tlio method of substitution. 

While ot first glance it may be thought that in tliis system, the encipherment of the groups 
of diftcront lengths eliminates the regularily or cyclic nature of the ordinaty periodic multiple* 
nlphabot cipher, such is not strictly the case. Xt is true that the groups are irregular m length, 
due to the action of the interrupter, but .at the same time, since the interruption key repeats 
itself miuxy times throughout a message, then wiU hi ngidar and (^ie npetUiona of eonstant oets 
of ffrowpo. In other words, tiiere is a cycle in the system, composed of a constant number of 
groups of irregular lengths. 'We shall first proceed to determine the lengtli of this cycle. 

'Hit Ungllf. of this ej/de ia.depmdeni upon tlus length of iho interruption key, and is the sum of 
the constituent numbers qfthe key. It is obvious tliat tlio key can be 2, 3, 4 . . . up to 13 numbera 
in length. It cannot be more than 13 because of tlio manner in which the key is derived from the 
upper half of alphabet 1. (See p. 66.) 

Now the numbers in those interruption keys do not repeat themselves,, and it foUows, 
therefore, os a simple m.atlicmatical fact, that the sums of the numbers composing every possible 
key — in other words; the lengths of' the possible cydes — arc constant and determinate quantities. 
Thus, if the key consists of 2 numbers, tlio cycle will be l-4-2=s8 letters in lengtli; if it consists 
of 3 numbers, the eyde will be 1 +2-}-3 = 0 letters in Icmgth. The following table gives ^e lengtii 
of all possible cydes of tlio entire system. 



Tabu XVJIf 



1 {.engUiatlnv 


lAOGtliiilerdo LoaettaoCkw 


L«n|Uio(oyalB 


1 

i n 


a il s 


28-(- 8=>SG 
86-(- 9^43 
4S+10^Sa 
S5+U^6d 
66-l-12>-78 
78-1-18-01 


! 3 


3— 3.S C 


<1. 


1 4 


di-l-IO 


Ill . _ . 


1 

_ . _ 


11 _ _ _ _ 


fi 


lS+0--=-31 ! 
31-i-7»28 


lO- 


7 _ 
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In orfisT Id dott’rmii’.o ilic eorront c.'vi-.' -er.:';!. ■vi' proceed ajiply the coincidence method 
doseriord i:: i !.-■ pnet of pnri 1 of lliis p.i|x-r. Ti:i: me3s;i;;c is wvittcii out in lines of 3, 6, 10, 

lo . . . lor« in len^'ili to correspond (o liiO virions liypothcses regarding length of cycle, as 
given in ; abu' XVii I . 

We. must iirst take into consiclemtion, Itowcvcr, t!ia fact that ^mowhere within the body 
'of the jue.saago thorc is u group of 3 lettcirs, compasing the indicators of the message, and since 
these 3 Icttci-s do not form a part of any cycle, they might throw all the cycles succeeding- them 
out of pimso witli tixe cycles preceding tlium. .Bu< , from the method given. for fixing the location 
of the indicator group, tiie latter must nci.uii* sumewiiero within the first 26 letters of tlxe' cipher 
message. By discarding tlic fimt 26 letters of our me.sNago then, we may be sure that we have 
cliniinaied this source of distortion. Starting with tlio twouty-eeventb letter of the message, 
viz, S, wc proceed to sot up tlie mcsssigu according to various bypotbotieol cycle lengths. We 
may omit the lengtlis 3 and 6, at the start u.^ higu^ improbablo. In this cose no table of coinci- 
dences is really necessary beoause the arrungoment of the message on the assumption of a cydo 
of 28 letters gives so many repetitious witliin tlio same columns that further corroboration is 
unnecessary. Note the repetitions of long polj'graplis below: 



Ficcas 34 

1 s I 4 t • 7 a gioiii3i3uui6i7!8i9 29naaiMM»zra 

SIEETPDKUL I S Y ZHWBR ES ATHRKZRG 
MNJGKDQKVVMFQBKENEIHAEAAMEKH 
FLVXRKE0KMHFM(7AFTI!r E_S P E B D D G W P 
JPXGDZVWUXLZAYUENILHAIUUAEAB 
REP WKFVJJKIPEMENFSBVYZKWDMKV 
LODOOBFMBBYWOQYVK V I X D G I K H E 0 N 
EEIWKWPAECLRNIHNNMODQNIKAOJK 
EOZTCFWDJJODWZAUESZDWK’GKKBDT 
XNKJDIHTXSNP6DURQFUD0NAZ.AZMW 
C L R H E 0 F Z W F H V K Z Y G E Y P N P K A Q'A H E D 
CCKKUGDJWILIABECESGSPFEPK'OPI 
SHHODNDMKF I 3 Y QMZKGLQGKTWY.GJW 
N P X N W B H E U L J V Q Z Z W I U L W W N J U C .D Q Q 
KKFDGffPUFSKIEBCCDXLF-HDODO:EAN 
K H M H F Z T C P M P P W D A V L M P .X K R 0 J K F N K 
VPRAOEWJO J V V ff R K D E W A F Y W H X E N U E 
NAPRNPVCUCLHYFMNLKHVPN 0 J K F ff W 
CLRAOOWZV JVVWRMDEWAFYB 0 X E N G Z 
FKVBPUXNUUAYMGXVNKOJKFSVBZRN 
LEIONGMNyPTDHGBJOLIPOWNPPWKN 
REP MDWXYEXNZONUEVOPWUQQQJNUP 
VFQJ-WNKKQLIPOWKOVBHFYMEPMDAF 
MHFABEFDMKHWUEODRL'ISYDKVZKWK 
MIQXKTHYNSVHEQAAE V I X PZTQKIAM 
FTTALOVJKVHDUETCLUHVWRHFUNNG 
FCKKEESYEJUPnWBC 
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Now, acrording to t-nblo XVIII, a cydc IciigUi of 23 roquiros that tho interruption kny consist 
of 7 mmibrrs, obviously tho digits 1 to 7. Wc do not know how many alpliabots aro involved, 
but tho}' probably do not oxcoed 7. If only 3 alphabets ore involved, then they would be dis- 
tributed in 7 irregular groups os follows: 

' Tabu XX. — Bania nj 3 alpialulM 

amuiM Alphabati 

1 1 

2 1—2 

3 i— 2 -a 

4 1—2—a—l 

5 1—2— a— 1—2 

6 1—2— 8—1—2— 8 

7 1— 2-*— 1-^— »— 1 



Alphabet 1 would be employed twelve times, olpliabet 2, nine times, and alp^bet 3, seven 
times. The following data pve the number of times the various alphabets woi^ be employed 
on diflerait hypotheses: 

Tablx XXZ 



Sasia ef 4 dlpkabda 
Cmupi AlphalNti 

1 1 

2 1—2 

3 1 — 2—8 



Batit efB alj^iabatt 
Gniipi tWwhtti 

11 
2 1—2 

3 1—2—3 



4 1— a— 8— 4 

o 1—2— 8— 4—1 

6 1—2— 8— 4 — 1—2 

7 1— 2-8-4— 1—2— 8 
VrariucneiMi 

Alphabet 1, 10 times 
Alphabet 2, 8 times 
Alphabet 8, 0 times 
Alphabet 4, 4 times 



4 1-2-3— 4 

5 1 — 2 — 8 — 4— 5 

6 1—2 8 - 4 - 8— 1 

7 1— 2-8— 4r-5— I— 2 

VmiiMiulii 

Alphabet 1, B times 
Alphabet 2, 7 thnes 
Alphabet 8, 6 times 
Alphabet 4, 4 times 
Alphabet 6, 8 times 



Basis 0 oIpAetstf 
Gnupi Al|dislN(i 

1 1 

2 1—2 

3 1—2—3 



1- 


-a- 


-8- 


-4 






X" 




-«SI_ 


V 

.d _ 


..JU 




X“ 

1- 


-2- 


-3- 




" o 
- 6 - 


■8—1 



RwiiieiidN 

Alphabet 1, S times 
Alphabet 2, 6 times 
Alphabet 3, 5 times 
Alphabet 4, 4 times 
.Mphabet 5, 3 times 
Alphabet 6, 2 times 



Onapi 

1 

2 

8 

4 

6 

6 

7 



Basis tf 7 olpAoMs 




1—2 



1—2—3 
1— 2-8- 4 
1-2-8 - 4— 6 




Alphabet 1, 7 times 
Alphabet 2, 6 times 
Alphabet 8, 5 times 
Alphabet 4, 4 times 
Alphabet 6, 3 times 
Alphabet 6, 2 times 
Alphabet 7, 1 time 



Wc shiilL now tiy to determine how many alphabets were employed, and how tiiqy are 
dislrib'.itcd within the cydo. Tho basis for tliis determination rests upon the possibility of 
comporiug tiic various frequency distributions produced by each olphabet and detoimining 
wbidi ere si;nilar. In other words, we proceed to dasdfy the various frequency tables into 
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sovornl ^rou]iSj tlio ninnbor of different! ^-oups correspondii^' to the number of olphnbets em- 
ployod in tlio nnvsn»c. According'!}', \m soliict one of the froquenqr tables os the most dosoly 
approximating that produced by a single mixed alphabet, and tiy to find others whidi seem to be 
identical with it, as evidenced by the index of eoinddence. Since there will be more than one 
'alphabet in ci\ch group, \ve sliall look for more than one high index of coincidence. For example, 
if vre find three indices which are much higher than the renuiining indices, we miqr condtidc that 
the alphabets corresponding to tlioso indices ore identical with the alphabet wluch is bdng 
used for comparison. 

Hoturning to table XIX, and following the reasoning g^ven on page 37 of the first port of this 
paper, columns 10 and 12 give the dosest approximationB to a theoretical sin^e frequem^ table, 
and wo shall start fitting frequencies with them as bases. 

Taking the frequenqr table for column 10 and applying it to that for every other column, we 
get the eoinddence data shown herewith: 



Tabus XXII 



ColuDn.._.. ..... 


n 


n 


n 


n 


D 


a 




n 


D 


U 


D 


D 


u 


u 


Total oeeuircnees .... 


52 


52 


52 


52 


1 

1 

52 


52 




62 


1 

52 


52 


. 52 


52 


52 


62 


Coincidences ........ 


17 


10 


9 


7 


6 


5 


10 


11 


9 


8 


7 


8 


6 


8 


Dilfcrcncos. ........... . 


-1 


BB 


-25 


-31 


-34 


-87 


tm 


-19 


-25 


-28 


-81 


-48 


-87 


-28 


Indices of coincidence.... 


-.02 


-.42 1 


-.48 


■ 


1 

-.65 


-.71 


B 


-.87 


-.48 


-.64 


-.60 


-.88 


-.71 


-.64 


..... — .. 


— 


1 

“ 1 


17 


u 


19 


a> 


It 


D 




D 


D 


D 


17 


» 


Total oceuTToncca........ 




62 


51 


51 


51 


51 


61 


51 


61 


51 


51 


51 


51 


51 


CoincidciiccB. ..... ... 




6 


10 


16 


6 


11 


8 


6 


6 


8 


■M 


6 


■m 


8 


DilTcroiicus ...... 




-34 


-21 


-3 




-18 


-42 


-83 


-83 


-27 








-27 


Indices of eoinddence.... 




-.65 


-.41 


-.06 


-.66 


-.85 


-.83 


-.65 


-.65 


-.63 


-.59 


-.65 


-.60 1 


-.68 



The indices of eoinddence for columns 1 and 18 are so high« relative to those for all other 
columns, that we may condude at once that the frequent tables for these two columns bdong 
to the some alphabet as cdumn 10. To corroborate this calculation, let us consolidate the time 
frequency tables to see whether the result of such consolida'tion will present the appearance of a 
single mixed substitution alphabet. 

noTIBE 35 



ABCDEZ'OHl JSXiSXKO? 




OBSTTTVWZYZ 

^ I ■ . ^ § 



Consolidated frociuoney table of oolumaa 1, 10, and 18 



We note at once that this frequeoi^ distribution gives every indication of being that of a 
single mixed alphabet. 

Hoturning to the tabic of indices of eoinddence for all other columns tried with column 10, wu 
note that the indices for columns 1 and 18 are in reality so much higgler than those for all other 
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cohnims that \vn may assmiio with a fniT -.if ;.hat this throe cnhimns thus classifird 

conMituti’ all :he coiiiimsK appiyinp: to one alphahot. Itafeii'nce to the lUo^psam (table XX) for a 
basis of 3 alphabets shows ihat nlpliabot 1 is sisud 12 times, alphabet 2, 9 tisies, and ulpliabet 3, 
7 times. Now if wo arc correct in our assmr.ptioti tliat columoa 1, 10, and IS coiisiituto all 
the columns for ono alphabet, then Ibis assumption auloinnticallv rules out the hypothesis tliat 
tlio problem involves 3 .nlphabots because, on tho hitter hypotliesis, there cannot bo an alphabet 
w'hitih is used but three times; and, for tho same reasons, a iiy pothesis of 4 alphabets is eliminated. 
It is possible, however, to iuivo ou alpluibai. which is used but three times on hypotheses of 5, 
G, or 7 alphaliois. In c.icli of theso cases, tlio alphabet to which columns 1, 10, and 18 would 
bolor.ir is alpliahet ,■>. 

I.ct IKS proceed now to find another uliihabct to wliich si>mu of Uio other frequency tables 
belong. Tlic table for column 12, having Uiu same iiverago frequency as that for column 10,' is 
taken as a basis for comparison for the next •'-Jassification of frequonoy tables. We may omit 
from the calculation tho frequent distiibiidons for columns 1, 10, and 18, siuiee they have 
already been classified. The data for this tost are as follows: 

Tarlb xxni 



Coluaio 


1 


s 


4 


s 

1 


t 


1 


a 


• 


11 


u 


u 


u 


Total oecurrciioea ... 


52 


52 


52 


53 1 


53 


52 


52 


52 


52 


53 


52 


62 


Coiiieldciiccs 


fl 


13 


d 


8 


10 


Bl 


6 


7 


0 


4 


0 


4 


DiSfercuoea .... 


-25 


-13 


-34 


-28 


-22 


■a 


-34 


-31 


-25 


Ea 


-25 


-40 


Indieoa of coincidence:.-. 


-.48 


-.25 


-.65 








m 


2n 


m 




B 


-.77 


Cdiimn.. ..... — — 


D 


17 ' 


D 


n 


a 


a 


19 




D 


D 




» 


Total oceuiTQuoes 


53 


51 


51 


51 


51 


51 


B 


B 


61 


51 


51 


51 


Coincidcncos 


11 


6 


8 


15 


0 


10 






6 


0 


6 


10 


DiiTenncca— i ' 


-10 


-33 


-27 


El 


-24 


-31 


Bil 


m 


Bl 


-24 


-SO 


-21 


Indices of coiucideuop-. 




m 




s 


m 


BO 




m 


-.65 


m 




-.41 



Wo may with a fair degree of certainty condudo that tho frequent^ distributions for columns 
7, 12, and 20 constitute members of another set. But it is impossiblo in this system to have two 
alphabets which are used exactly the same number of times; and since we have already assumed 
that columns 1, 10, and 18 constitute the alphabet which is used three times, we ore forced to 
conchido that ono or more odditiozuil columns in this teat bdong with columns 7, 12, and 20. 
Now the index for column 3, viz, —.25, is tho dosoet to tho indices for oolunons 7 and 20, and 
it. is perhaps likdy that it bdongs with tlieae columns. But the index for column 24 is —.35, 
and tiiat for column 16, —.37, so ' that wo must apply a seoondoxy test. Lot us oonsdidato 
tho frequency tables for oolumns 7, 12, and 20, and then make a calculation of index of coin- 
ddcnco for columns 3, Si4, and 16. 
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Figure R6 



A B C S E Z' 



aEX JEX.K29TOP QBS TtZVWZTZ 



CouBolidatcd frequency table of columiu 7, 12, and 20 





t 


u 


■EM 




Fraquonoy... 


103 


103 




Colneidaiiena. . . _ 


21 


14 


IS 




—40 


— A1 


—48 


ludioca of cidnoidcncc 


-.89 


-.50 


-.47 



It is very probaUo that column 16 does not belong in the samo alphabet with columns 7, 12, 
and 20, but columns 3 and 24 do. To corroborato, lot us odd their eozresponding frequency dis- 
tributions to the consolidatod table. 

Fiauiuc 37 



ABCZZBPOEI J-KLZCarOPaBBTirVWZYZ 




Consolidated ftequeney table of columns 3, 7, 13, 20, and 24 

There is every indication that vre arc dealing with a distribution for a single mixed 8iphr.)>et, 
and that no foreign elements have been introduced. If we have succeeded in colloeting ?!, the 
columns which sliould belong to the alphabet being studied, it follows tliat tho latter must be 
alphabet 3, for it is used five times on a hypothecs of 5, 6, or 7 slphabets. 

Lot us now condder tho sequence of alphabet numbers which results when wo set down the 
alphabet to which each of the columns now dassLUed bebngs. 

Fiouns 38 

l-2-3-4-5-fr-7-8-9-10-ll-12-13-14-l&-16-17-18-lB-20-31-22-23-24-23-26-27-38 
53353 58 3 



Now it uill be remembered that in eedphonnont tho lettots in eadi group were reversed 
before grouping into fives for tdegraphio transmission. It follows, thorefoie, that the alphabet 
numbeni applying to roliunns when the messago is set up as shown in figure 82 will bo in groups 
of defending series of numbers and this descent is complete in eaoh case to number 1. Thus, 
for example, the series in the illiLstrativo incsaago on page 67 are as follows: 

Fiucur 39 

l-2-3-4-A-6-7-«-n-10 -1 1-12-1S-14-15-16-17-18-10-26-2I 
l-8-2-M-2-l-:{-a T-- 3- 2 - 1-2-1-8-2 1-3-2-1 

lletiiniing now U> figure 33, wo sco rhat our classifiRation of columns b in accordance with 
t'lo rooiiintinonb of descending series of nuiubois, and wo may oven fill in many numbois witiiont 
any fiirihor test. Tims, tho scquonco bocomes: 

Fiumix 40 

l-2-3-l-6-a-7-8-O-l0-ll-l2-i:<-14-15-10-17-18-l»-2q-21-83-23-24r-26-20^7-2S 
5— 1—3— 2-1 3-2—1— 5— 4— 3— ’<i— 1 6~ 4— i— 3— 1 3— 3— 1 



To corroborate thb soquenco, let us consolidate the frequency tables for columns 2, 11, and 
10 into one tablo; those for columns 4, 8, 13, 21, and 25 into a second taUe; and those for columns 
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5, 9, 14, 22, uiid 2C into a third Liibli;. \Vv. ilu this tu see if tlio resulting table in cuclx citsc 
corresponds to that of a single mixed nlphahot. 



I'lcniix 41 

Cdliuniu 2, 11, luul Iti 

ABODEVOKI jrKXi2XirO]PQB8T17VWZ7Z 




Coluinna 4, K, IS, 21, and 25 

ABCBEPOHX J’EI.MSrO?OB8TtrVVZ7Z 




Columns fi, 0, 14, 22, and 28 

ABCSSVOHl JKX.M1T OPQBBT17VVZ7Z 




The approximation to tingle alphabet distributions is, in each case, voiy close, and we may 
assume our work thus far to be correct. The only columns remaining to bo assigned to alphabets 
are 6, 15, 1C, 17, 23, 27, and 28. 

Xow column 6 con only belong to cithor alphabet 4 or alphabet 1; and the some alternatives 
present themselves with respect to column 23. Tliis is because these odumns, bting bounded on 
either side by columns already distributed into alphabets 1 and 3, must eithor be members of 
the descending sequence 4 3 2 1, or must bo tlio column forming that part of the interruption kqy 
which consists of the number 1, thus constituting tlie isolated column which must always bo 
enciphered by alphabet 1. In order to dotermino whidi of tlieso alternative astignments is the 
case, we may make a singlo calculation by finding the index of coinddenco when column 6 is 
grouped witli columns 5, 9, 14, 22, and 26, and when grouped with columns 2, 11, and 19. In 
order to make the results strictly comparable wo slrould include but throe colunms belonging to 
alphabet 1, bcimuso there are only throe columns astignod thus far to alphabet 4. Let us make a 
special consolidation of colunms 5, 9, and 14 for this tost. 

FiGUBn 42 

A3 CSSFOSI JSLlCarOPQBBTirVWZTZ 




Special conaolldatod froqueney table of coliuniiB 5, B, 14 
ABCBBPaEZ J-SI.ICNOPOBBTTTVWZ7Z 




Conaolidated frequency table of columns 2, 11, and IB 

Total frequency, columns 0, 5, 9, and 14. 104 Total frequency, (xdunui 6, 2, 11, and IB...... 103 

Total coincidcuccs, column 6 with 6, 9, and 14 20 Total coinoldoncee, column 0 with 2, 1 1, and 19.. 19 

DiCorcuco 3(2ij) — 104 —20 DilTcrcnco 3(10}— 108 —46 

Index of coincidence........................ —•25 Index of ooinoldence— — >46 
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Tliorc fioraiis to bo no doubt tlmi column 6 bolongH to alphabet i and not to alphabot 4. 
This autiiinnticRlIy makes it nccoasoiy that cuiuuw 23 bo assign^ to alphabet 4 since we cannot 
huv(> two no. 1 coltirniis in the same iiitcnniption key. 

Let U.S set down what we h.avo: 

> Fiuoiis -i:i 

l-2-3-t-5-C-7-« -0- 10- 11-12-13-1 1 - 15-i(5-17-lK-10-20-ai-2a-23-2'l-25-2ft-a7-2K 

5— 1— 1— 3— S— 4— 3* ^ 1 5— 4r* 3^ 1 — 4* 3— 2’' 1 - 

Wo now proceed to coinparo those HcruciUMss with tho sequences of idphabets within the 
reversed tTiviips oii hypotheses of 5, d, and 7 olpbaJicts. 

We may conedudo, by n short analysis, Uiat a hypotlicsis of flve nlphabete must bo correct, 
because Ihfl distribution of columns as now made, if it bo eorruet (and wo f^ fairly certain of 
our work so far), allows no room for any alphabot fi or 7. This analyms is fairly simple. Let 
us assume a hypothesis of 6 alphabets. 

Consider the two places where alphabet 6 would have to fall: 

“• • 

yiODBX 44 

■ * 

1&-14-16-16-17-1S-18 25-2G-27-28-1-2 

• • • 3- 1 ^ 6— 4r-. • • • • *1 • • • 

According to this diogmin, column 27, isolated os it stands, would have to bear the number 1. 
But we found, already, that edumn Q is column which forme the isolated group containing 
only one number. Hence, a hypothesis of 6 alphabets cannot be correct. . , 

Let us aaeume a hypoihesia of 7 alphabets. Note these two placee once more: 

Pxcras 45 

13-14-l»-ld-17-l&-19 S5-28-27-2S-1-2 

... 2-1 4— ..... 2-1—7“ 6 "S* 4 ■ ... 

Here again we would have an isolated column (16), which for aamilar reasons cannot bear 
the number 1. " 

We ore loft, therefore, only tlie hypothous of 5 alphabets. This requires that alphabet' 1 be 
used nine times, and alphabet 2, seven times. Wo have already distributed 6 columns to alphabet 
1, and 6 cohunns to alphabet 2, leo'vmg 3 more columns to be assigned to the former, end 2 more 
to tbo Latter. We may assign columns 27 and 28 immediately to alphabets 2 end 1, zespectivefy. 
The sequence thus becomes: -• 

Fioubb 40 

l-2-3-4-6-«-7-«-0-10-ll-12-13-14-l£-10-17-18-12-20-21-22-42-24-26-32-27-2S 

S-4-3-2-1-1-3-3-1- 5- 4- 3-2-1 5-4-3-2-l-4r-3-2- 1-2-1 

5 1 3 5 6 4 2 

If WO molco a grouping of columns to correspond with tho probaUe interruption key, as 
sliov/n in figure 46, wo note that wo have 2 extra groups of 5 and no group of C and 7, respec- 
tively, ns required by a key of tlio dotonninod lougth. If, however, we shift tlie sequence so os 
to bring columns 27 and 28 to tlie loft, whidi is perfectly legitimate since wo are dealing with 
a cyclic key, wo have: 

Fioube 47 

27-23-1-2-3-4-5-6-7-8-9-10-11-12-13-14-15-16-17-18-19-30-31-23-23-24-25-20 
2- 1-5-4-3-3-1-1-8-2-1- 5-4-3- 2-1- 5-4-3-3-1-4-8-2-1 

7 18 C 5 4* - 
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Wp now Imvc our groi>|> of 7 columns, im'r l;iok tlic 1 of C nnd 1 of 2; but we have 2 groups 
of 5 columns. Obviously, tiio only thing to do is ix> Insert tho sequence 2-1—1— in its proper 
piece ns follows: 

' Firuub 4S 

27-23-l-a^-4-5-C-7-8-0-10-ll-12-l^l-t-15-lG-17-l»-10-20-21-22-23-2^25-20 
2- l-n5-4-3-2-l-l-»-2-l- 5- 4- 3- 2- 1- 2- 1- 1- fr- 4- 3- 2- 1- 4- 3- 2- 1 
7 1 3 5 2 U 4 

Wo have thus reconstructed the complete iiitcrnipuon key, and may now proceed to tran- 
scribe tlie message in groups in accordance tlicrcwith. Since the first group contains 7 letters, 
and since the letter S, willi which wo atei*tcd our first transcription (^. 34), is labelled 1, it is 
dear that vto must start our second transcription with the two letters preceding S, i.e., the 
twenty-fifth and twenty-sixth letters of the cipher message. Thus: 

Ficubb 49 

7 13 S 3 6 -i- 

PMSIEET P OKU LISYZ HW BRESAT HRKZ 

R & M N J G K D QKV VMFOB KE NEIHAE A A U E 

etc., etc. 

After this transcription a third sot-up is necessary, in which the sequences of letters ore 
reversed within groups, in order to bring tlie cipher letters back into the original arrangement 
before transposition. The rearranged set-up is os follows: 

Fioube 50 









7 








1 




3 








5 






4 


2 






6 








4 


i 




1 


3 


3 


4 


3 


1 


3 


1 


1 


3 


3 


1 


3 


a 


4 


a 


1 


3 


1 


8 


3 


4 


3 


1 


1 


3 


3 


4 


T 


E 


E 


I 


s 


M 


P 


P 


U 


K 


D 


Z 


Y 


S 


I 


L 


w 


H 


T 


A 


s 


E 


R 


B 


Z 


X 


R H 


K 


G 


J 


N 


M 


G 


R 


0 


V 


K 


Q 


B 


Q 


F 


M 


V 


E 


K 


B 


A H 


I 


E 


N 


E 


If 


A 


A 


R 


X 


W 


L 


F 


H 


K 


K 


K 


0 


E 


W 


u 


F 


H 


H 


F 


A 


E 


P 


s 


E 


W 


T 


G 


D 


D 


B 


D 


G 


X 


P 


J 


P 


W 


Z 


U 


W 


V 


Y 


A 


Z 


L 


X 


E 


U 


I 


A 


H 


L 


I 


N 


E 


A 


U 


U 


K 


W 


D 

* 


E 


R 


B 


A 


F 


J 


J 


V 


M 


E 


P 


I 


K 


N 


E 


Z 


Y 


V 


B 


S 


F 


M 


D 


X 


W 


0 


0 


D 


0 


L 


V 


X 


B 


B 


H 


F 


Q 


0 


W 


Y 


B 


V 


Y 


G 


D 


X 


I 


V 


X 


E 


H 


X 


I 


K 


ff 


I 


E 


E 


N 


0 


W 


E 


A 


P 


I 


N 


R 


C 


L 


N 


H 


N 


Q 


D 


0 


u 


N 


0 


A 


X 


I 


C 


T 


Z 


0 


E 


K 


J 


F 


J 


0 


W 


z 


W 


D 


0 


J 


A 


U 


K 


W 


D 


Z 


s 


E 


B 


X 


X 


G 


D 


G 


X 


N 


X 


T 


D 


I 


K 


T H 


D 


G 


o 

4 


N 


S 


R 


U 


N 


0 


D 


11 


F 


Q 


Z 


A 


z 


A 


E 


H 


R 


L 


c 


W 


If 


0 


W 


Z 


F 


Z 


K 


V 


H 


F 


G 


Y 


K 


P 


N 


P 


V 


E 


H 


A 


Q 


A 


U 


K 


K 


C 


c 


D 


M 


G 


W 


J 


D 


B 


A 


I 


L 


I 


C 


E 


F 


P 


S 


G 


S 


E 


0 


X 


p 


E 


D 


0 


H 


K 


s 


I 


P 


N 


K 


M 


D 


Q 


Y 


s 


I 


F 


Z 


M 


K 


Q 


Q 


I 


G 


X 


G 


Y 


w 


T 


W 


N 


K 


p 


N 


W 


J 


B 


U 


E 


H 


z 


Q 


Y 


J 


L 


w 


Z 


N 


W 


W 


L 


u 


I 


D 


C 


u 


J 


G 


D 


F 


K 


X 


Q 


Q 


W 


F 


M 


P 


B 


E 


I 


K 


S 


c 


C 


D 


u 


F 


L 


X 


D 


0 


D 


0 


E 


F 


K 


M 


H 


K 


N 


A 


Z 


P 


C 


T 


D 


W 


P 


P 


U 


V 


A 


R 


X 


X 


P 


M 


L 


F 


X 


J 


0 


0 


A 


R 


P 


V 


M 


N 


E 


0 


J 


W 


R 


W 


V 


V 


J 


D 


U 


W 


Y 


F 


A 


w 


E 


N 


E 


X 


H 


N 


R 


P 


A 


N 


E 


M 


P 


U 


c 


V 


F 


Y 


H 


L 


c 


N 


M 


N 


P 


V 


H 


X 


L 


F 


X 


J 


0 


0 


A 


R 


L 


n 

KJ 


W 


W 


0 


V 


z 


W 


R 


W 


V 


V 


J 


D 


II 


W 


Y 


F 


A 


w 


E 


N 


E 


X 


0 


P 


W 


V 


K 


P 


Z 


G 


X 


U 


N 


U 


G 


H 


Y 


A 


H 


V 


X 


F 


X 


J 


0 


K 


N 


Z 


B 


V 


S 


N 


0 


I 


E 


L 


K 


R 


G 


V 


N 


H 


G 


H 


D 


T 


F 


J 


B 


ff 


0 


P 


I 


L 


0 


W 


P 


p 


N 


D 


U 


o 

* 


E 


R 


N 


K 


W 


E 


Y 


X 


N 


0 


Z 


N 


X 


E 


U 


Q 


u 


W 


P 


0 


V 


N 


J 


Q Q 


W 


J 


0. 


F 


V 


P 


M 


N 


0 


K 


K 


W 


0 


P 


I 


L 


0 


K 


M 


Y 


F 


H 


B 


V 


D 


M 


P 


E 


5 


A 


? 


K 
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F 
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E 


M 


D 


F 


E 


U 


W 


H 


K 


D 


0 


D 


Y 


S 


I 


L 


R 


X 


Z 


V 


X 


K 


X 


Q 


I 


« * 

Jn 


K 


W 


T 


N 


Y 


H 


Q 


E 


H 


V 


S 


A 


A 


Z 


P 


X 


I 


V 


E 


I 


X 


Q 


T 


L 


A 


rp 


T 


F 


M 


A 


0 


K 


J 


V 


E 


U 


D 


H 


V 


C 


T 


R 


W 


V H 11 


L 


N U 


F 


H 


E 


K 


K 


C 


p 


N 


G 


S 


E 


Y 


E 


W 


W 


P 


U 


J 


C 


B 
























REF ID : A64722 



78 



Vi'i- siro uuw cuiifroutcd witii a ratlicr fdtuplo case of tlio onalj’Bis of five rteiproeal and 
inlcnvIntoJ alplinbcts. They are oompoaod of tho consolidated froquen^ taUcs applying to 
the columns which belong in the some alphabets and are as follows: 

Tabu XXIV 
X 

ColumiM 26— 28 



ABOSSFaBI TXX.UBOB 



^ ^ S ^ 
g g ^ 
g 



g 



g " 



O B B 

g g 



•s V V vr 
g g g g 



Y 2 

" g 



Coluimu 4—6—16—16—21—26—27 



A B C S S 

g 5 s g g 



7 a H X T 



ZX.UVOPQB 8 



T B V W. X Y 2 



ABCBS7GHZ 



3 

Ooluinns 3—7—18-26—24 
JX£ 6 C 2 r 07 QB 



g 



g ■=== 



T IT V W X Y 

= =S g g g - 



ABCDS70BI 

g " =5: g =5 g g 



■ CIoluiuiM 2 — 11—10—23 
J 2 CZ.I 2 K 07 QB 

S g = g g g " 



g 



T u V w 3 : r 



Cdiuniui 1 — 10—18 



ABC 

r- 

*«■ 



OSPOHI jrLJUCXO 

^g- ^gggg=i'' 



7QKSTVVWXY2 

5l g g ^ § 



It v/ill bo uniioccssniy in this paper to discuss the method of deciphering tlie message by an 
analysis of tJicso livo singlo-froqucncy tables. Suflico it to indicate tlie values obtained from the 
dociphenucnc. Tliey are £pvcn boiow, whoru tlio values obtained from a Imowltidgc of the 
reciprocal relation are placed- within poronthoses. ■ Only four values remain unknown, all in 
alphabet 5. 
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Taih-is XXV 


























(1) 


.. A 
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0 
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R S 
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*r 

I 
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Y 


Z 




K 


L 
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N 


R T 


V 


V 


T 


A B 


C 


E 


D 


D 


X 


F Z 


G 


P 


H 


I 


(Q) 


J 


8 


(2) 


.. A 


B 
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D 


E 


F G 


H 


I 


J 


X L 


E 


K 


0 


P 


Q 


R S 


T 


U 


7 


V 


X 


Y 


Z 




E 


X 


P 


H 


A 


(Q) G 


D 


(K) 


L 


I (J) 


S 


0 


N 


c 


F 


W (X) 


Y 


G 


(Z) 


R 


B 


T 


V 


<8) 


.. A 


B 


C 


0 


E 


F C 


H 


I 


J 


X L 


U 


K 


0 


p 


Q 


R S 


T 


U 


V 


Y 


X 


Y 


Z 




W 


(Y)(Z) B 


D 


I {jy s 


F 


G 


M (Q) 


u 


X 


(V) 


T 


L 


X H 


P 


U 


0 


A 


R 


B 


c 


(4) 


.. A 


B 


c 


0 


E 


F G 


H 


I 


J 


K L 


u 


N 


0 


P 


Q 


R S 


T 


u 


V 


W 


X 


Y 


z 




0 


F 


6 (Z) 


Z 


B C 


N 


E 


X 


R T 


V 


H 


A 


S 


I 


(X) P 


L 


Y 


H (QXJ) 


U 


0 


(5) 


.. A 


B 


C 


D 


E 


F G 


H 


I 


J 


X L 


u 


N 


0 


P 


Q 


R S 


T 


U 


V 


Y 


X 


Y 


z 




(F) 


G 


H (K) 


V 


A B (C) Z 


P 


D R 


T 


17 


S (J> 




Z. 0 (HI 




E 


H 


z 







Wo may now attempt a roconstruction of tlic ori^nal, or pziniacy, -alphabet» of wliieh these 
are socondaiies. Note the following Taluos: 



In alphabet 1, E«=N 
In alphabet 2, Nb=0 
In alphabet 3, Oe=v 
In alphabet 4, VsslS 
In alphabet 5, IfsT 

Now if E oocuis in the upper half of alphabet 1, the table of alphabete must a ««bnnn 

fiko tbis: 

Alphabet 1 
Alphabet 3 
Alphabet 5 

Let us assume this to be cozrect. In alphabet 3,E will again be in the upper half of the 
alphabet. We have these values: 

In alphabet 3, EaD 
In alphabet 4, D«Z 
In alphabet 5, Z^f 

But to this we may add two mora values since we have the value of E in alphabet 2. ^us: 

In alphabet 1, Ks=A 
In alphabet 2, A»E 
In alphabet 3, Ea=D 
In alphabet 4, D«sZ 
La alphabet 5, 



Alphabet 2 
Alphabet 4 



CC400— M 
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Since tlio upper lino in alphabet 3 ia displaced but one letter to the lig^t as oompared-iiith 
that of alphabet 1, ve have two columns in the table os foUows: - 



^ f E K 
Alphabet 1 W 

f 0 E 

Alphabet 3 | ^ ^ 



Alphabet 5 | 



I Alphabet 2 
|Alphabet 4 



We may continue thus: 



Hence we have this: 



In alphabet 1, 17=1 
In olphabot 2, I=K 
In alphabet 3, K=N 
In alphabet 4, N=H 
In alphabet 5, H=C 



. f E K ff 

IMAIl 
... r 0 E K 
AlpluO»t 3 1 ^ P ^ . 

A)pll«l»ts| , „ g 



The prooesB is veiy simple and easy to continue. Finally we have this: 

,,*,fEKWFXLY 
^ Ihaircibji 

I Alphabet 4 

Rii.v*,iJMZHOEKW/ * 

Alphabets! ^ ^ g ^ ^ ^ 



Alphabet 5 1 ^ y 



^ ^ I Alphabet 2 

J J J.^phabet4 



We may now continue from the sequences given already* Thus in the last line we see the 
sequence . . . S V D Nj in the fourth line, V D N A I R Q. Hence, we may add A I R Q 
to the lost line. The same process applied to the other lines gives us: 



. r E K W 
Alphabet jj I 



S 4 9 C 7 

17 F X L y 



‘INAIRZBJ U CS 

.fOEKWFXLY tl 

Alphabet 3|vDNAIRQBJ. U 

JMZHOEKWFXLY 
Alphabet 6|^uQsyj)jjAIRQBj 



M Z H 0 
C S V D 
tl Z H 

u c s 



I Alphabet 2 
I Alphabet 4 
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Wo may fill iii tho rest Trom tlio nlpliabots lluimnulvca, aad wo have: 

VliiUHS 31 

lS345«TSfli0 11UU 

*iiv /EKWFXLYGPMZHO 

Alpaftbot HuAIRQBJTUCSVDI 

,, ^ rOEKWFXLYGPlIZHJ 2 

Alphabets DNAIRQBJTUCS1,, 

fMZHOEKWFXLYG 
Alphabet s|tucsVDNAIRQBJ 

Taking alphabet 1, a speedy xeoonstruotion of the original leetapglo is at once efleoted. 
Ulus: 

Ftama 53 

EXPORTS 
K L M N Q U V 
W Y Z A B E O 
F G H I J 

Tho keyword is EXPORTS. In confonnity with the agreementa of the qvtam, the indioators 
should be K X Y and indicate the following: 

X, the initial letter of the distorted alphabet; 

X, the fifth letter in alphabet 1, hence five alphabets; 

Y, the seventh letter in alphabet 1, hence seven groups azzanged as foDows: 

E K W F X L Y 

1 3 S 1 « « 7 

The indicators will be after either tho fifth lett^ or the nineteenth (corresponding to the 
' numorLcalvalueof the initial or final letter of tho keyword). We find K X Y after the nineteenth 
letter. 

Wo may now proceed to decipher the first few groups of the message and the entire solution 
is at hand. It is as follows: 



1 


3 






5 


2 


6 




4 


7 


1 


1 2 3 


1 


3 


3 4 S 


1 2 1 


3 3 4 8 1 


1 3 


3 4 


1 3 3 4 8 1 3 


K 


H N V 


L 


I 


Q K G 


K N N 


H K V Q E 


E X(X X Y)X 0 


P M S I E E T 


K 


V N H 


G 


K 


Q I L 


N K E 


Q V X H N 


0 X 


X E 


T E E I S U P 


A 


H 0 S 


T 


I 


L E R 


E I N 


FORCE 


D B 


R I 


G A D E 0 C C 










1 


3 


5 
















1 


1 3.8 


1 3 3 4 5 
















P 


D K U 


L I S Y Z 


• ■ 


• • • 












P 


U K D 


Z Y S I L 


• • 


• ■ • 












U 


PIE 


S T H E R 


• • 


• • ■ 





‘‘A hostile rrinforeed brigade occupies the . . .” 



REF ID : A64722 



82 

2. Solution of a series of messages.- -TIio sitluiion of a sorios of messages iu this cipher 
pre»oiiis iiii intprcb«liiig demonstration of dio fact that in ctyptograpliy it is often the more 
insignificant details of a system that enable Uic eiyptonolyst to solve the enemy’s messages 
rather than any dednito weakness of the method from the cryptographic point of view. In 
this case, tlio solution of a series of messages based upon the same keyword, but involving 
all tho manifold modifleations of alphabets and interraption key, can be achieved without 
attempting tho dociphonneut of a single message. The method involves merely on analysis 
of tho indicators for a sorios of messages, resulting in a direct and speedy roconstruotion of the 
various generating rectenglos derived from tho some keyword. 

In the first place, it may be pointed out at once that the various generating rectangles 
based upon tho some keyword consist of two parts: A constant sequence, enmaatin g of the key- 
vord pnper, making up tho first line of the rectangle, and a variable, or revolving sequence, 
consisting of the rertioining letters of the alphabet, or os we shall tenn it, the “retidual Mqusncs”, 
nmking up tho remaining linos of tho rectangle. 

In tlio second placo, tlio length of tho interruption kqy for each message can be determined 
by applying tho principlae of coineidence os explained on page 68. Each message is then 
accompanied by the number thus found. 

Given the bogiimmga of a series of IS messages, the lengths of whose interruption keys 
have been detonmnod and ore as indicated below, lot us proceed to an analyos of lengths, 
whidi in a short tune will lead to a direct reconstruction of the kqyword. 

Momc* 

1 K I B I Y 

2 P 0 N Q D 

3 Q Z M V R 

4 0 Z A P N 

5 A A Z S L 

C F U Y 0 T 

7 A T N L B 

8 D Z N D J 

0 V R A Y Q 

10 I A B 0 Q 

11 C T T 0 S 

12 Y L P A Y 

13 D T X D R 

14 G J E T M 

15 S B U X L 

16 I B U R T 

17 0 Y Z V A 

18 T M R G V 

Lot us now turn our attention to the generating roctongdos that are possible, indice (iug 
merely their outlines, for keywords of 5 to 10 letters in length, and numbering the n.-xArei 
which will contain the first, second, third, to the ninth letters of the first line of the e’-puabet 



PieuBX S3.— Beginning* qf aertitt ef nttuagte 



F P P L H 
A U N D U 
P X V T W 
V Z U L V 
I G I A X 
U U F E R 
P T R Z T 
U B R U L 
E R H P 0 
K U L D T 
J X V X P 
N L W 0 F 
C G T K T 
J B P P U 
R Q Y L L 
W Y J A H 
R G ,U Y Y 
I E E 0 K 



G Z 0 P B 
B N K Y 0 
H 17 J X U 
U Y H U K 

V H G 17 M 
B B B S F 

Y W 0 B A 
U F F H H 
P V J Y 0 
D E Q D U 
0 A D S F 
G K P 0 S 
C K 0 Z U 
Q T D U F 
A I K U 0 
T H 0 P B 
B G H W U 
H X F U U 



U A 17 F V 
A 17 D R T 
M M E Q Z 

V L J L W 
Q U Z S I 
0 T P P I 

V I K Y H 
17 V R E A 
W U N X Q 
J A U C W 
J J P Y H 
G N N Q 0 
K Y A Z Y 
C T C A L 
L P H Y B 
K L Q C V 
li E K D 5 

V 6 F I Z 



T B N B .1 
17 Q Z N J 
17 A X 6 U 
G V P 0 P 
0 I T Y E 
A Y K N A 
Z U A L X 
A Q T S S 

V A C A W 
A S D K Y 

V L C K H 
C U 0 F K 
T U K N I 
C V Z E D 

V T A K F 
0 G X L G 
M G H F G 
S Z K J T 



F U E B 6 
L M U R 5 
S V H Y 4 
Z G A Y 8 
R G B N 7 

V Q V L 8 

V X R X 8 
Q S 0 N 8 
Q N E U S 
T P A D 7 
P D A U 5 
0 N V F 8 
A N E G 4 
A P U Q‘ 5 
G L H Y 5 
N J X P 6 
N T 0 G 4 
y Y X 0 4 
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o’ inure tlian 10 I''.l{=-»r3, and no interrupr 
v^cipios to ba oluddal.'^d may bo extended by 







Note now that in (A) of fie^uro 54, the seventh letter of line 1 of the dphabet table based 
upon a keyword of 5 lottora will bo the second letter of the keyword proper and, as such,, doe^ 
not change its position. This follows from the method of constructing the alphabets from tho 
generating rectangle. In other words, with a 5-lottcr keyword, no matter what letters be 
chosen as the mitial letters of the various possible generating zectongleB, all numerical keys 
consisting of seven numbers will always bo designated by the same letter in the indicator group,, 
because the first lino of all generating roctsmglcs based upon the some keyword is always the 
same. Since the letter which designates tlio length of the interruption key is the third letter 
of the incicator group for a keyword of 5 letters, ail messages which factor for a key of seven 
numbers must sliow the same letter os the tbh'd dement of the indipator group. Conversely, if 
messages wludi factor for a kpy of seven numbers show a oonstanpy with respect to the ^rd 
dement of the indicator group, then it must follow that the generating rectangle is hosed upon a 
5-letier keyword, and tliat the corresponding indicator letter is the second letter of the key- 
word proper. Similarly, when the third dement of the indicator group is constant in messages 
which facior for a kpy of six numbers, a gonor.ating rectangle based upon a 6-lotter keyword is 
indicated and the corresponding indicator letter b the second letter of the keyword proper. 
When. the iliird dements of iioo sets of indicator groups are constant andthe mterruption keys 
for the cori*csponding sets of messages consist of five and nine numbers, a 7- or S-lotter keyword 
isindicated and the carresponding indicator letters ore the second and third letters of the 
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Tasce XXVI 



' Meaose au:nbcr 


( 

ladliator gmup 




nuUentarBnap 


Ln(tliarioi» 

rapUMkir 


{ 

1 0 




8 


3 


J X U 


4 


11 




5 


IS 


KUO 


6 


14 


D U F 


6 


S 


X T 0 


5 


8 


F U H 


8 


7 


0 B Q 


8 


18 


F IT U 


4 


13 


0 Z U 


4 


5 


G V U 


7 


1 


0 P B 


6 


4 


H 11 X 


8 


16 


0 P B 


6 


17 


H X U 


4 


12 




8 


9 


J y 0 


8 


10 


nn 


7 



Fzom the airoDgemfiiit of the messages in figure 55 and this list we conoLude the following: 

(1) Tlie keyword eonsists of either 9 or 10 lottei's. . 

(2) The letter L is either the initial or final letter of the keyword proper. 

(3) The letters UU form the second and third letters of the k^word proper. 

(4) The letters B, D, F, G, H, J, K, 0, P, Q are not in the keyword proper. 

Wc now toko each indicator group and from ils accompanying data make certain deductions 
with respect to the sequence of letters in the keyword olplmbet. Thus, for example, the indicator 
group B S F, applying to a message, the length of whose interruption key is 8, means that certain 
of the letters within the generating rectangle concerned are os shown in figure 66, where both 
possibilities as regards the length of the k^irwoid are indicated. 



VxaTnU) 50 

(A) (B) = 

isa4<«r«o isssierseio 
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1 


III 1 ■ 




rr 


■■ 


■ 


■ 


m 


m 


E 


1 


11 1 lb 




m 


mm 


■ 


■ 


S 


■ 


na 


i 


IN i 


■ 


■ 


D 


1 


1 


1 


1 



Since there is room for but one letter between B and F, it follows that only one of the inter- 
vening letters, C, D, and E, isleftinthoresidual8equcnco,theothertwobcinginthekeywordproper. 
Now the indicator group D S F shows that D is tlie initial letter for the generating rectangle for 
message 11, and, since a letter which con be an initial letter of an indicator group cannot be a 
letter of the keyword proper, it follows that the order in the residual sequence is B D F, and 
that the letters C and E are in the keyword proper. 

This is confirmed by the indicator group D S F, which accompanying message 11 has an 
interruption kqy consisting of five numbers. Thus: 

Frooss 57 

(A) (B) 
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I 
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t 









D 


TJ 


I 


J 
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u 


lil 
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n 


Dl 


IT 
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Ap:.i:n, the indicator group F M H, applying to message 8, wliosc intotruption key consists 
of cigiit nimibors, slion-s that the sequence must be B D F G H. Thus: 



Fiavnz SS 



(A) 



sm 








B 


!F 


n 


HI 1 j 1 


B 






j 









CB) 

FRF 



1334I67SBU 

li^is 



I ; 

I : 

There is one q>ace vacant between F and K, which must bo occupied by G. 

The conclusions furnished by the other indicator groups are ^ven iii (he diagrams b\-low. 
Tlio question as to whether a 9-lotter or a lO-Iottor key word is involved will bo dcLenninc<l later. 



niK— 8 






1 9 9 



jyo—bIj 



Pi? 



FiGxmE 50 
H (IJ) K 

Since J is proseat as an 
initial letter the Bcquonoa 
must be: B D F Th J K 



J (K L») 0 
Since K is present as on 
initial lcti^Jr, the seuuenco 
must bo:.B b F GH J X 0 



OBQ— bIoRIt 



0 ( ) Q 

Since P foils nntunl^ 
between 0 and (L &e sequ 
must, be: B D F G H JK 



0 P Q 



In the absence of additional indicator groups, with different initiel letters, we cannot 
continue in the same manner and reconstruct the entire sequence. Let us, however, by to 
dotenninc now whether the keyword contains 9 or 10 lettm. With the residual sequence 
as detemiinod above, let us fill in the generating reotan^e for the indicator group D S F, 
according to both assumed kqyword-lmgths. Thus: ............. 



Ficmw 60 



CA) 



(B) 



19S4««78S 



HnE 


I 




IT 


-LJ 

.PJO.: 


ISj i 
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b| ! 


! 1 




i 




i I 



1 


2 


1 


.4 


fi 


c 


7 


a 


a 


10 


1 ■' 
lb 


If 

F 


U 

G' 


H 


7 


r 


s 


i 


IDj 


1 


□ 


— 






— 




1 


1 


1 


I 



Now the. inlerniplion key for tliu iudicator gnm]> D S:F conaisls of five numbers. The 
sysLoiu is siii’.h that tlio number of aliiliiiluMa employed in n mcss:ige must bn equal to or less 
ilinii the leiigdi of the interruption key; it eannut be more tlum this length. Hence, the letter 
S of the iiiilie^itor group D S F must iudicato a miinbor of ulpliabots equal to five, or loss than 
five. Upon the basis of a 10-lettor keyword, tlio number of alphabeta indicated by the letter S 
would be iinpossiblo; but upon the basis of a 9-lctter keyword, the number of olihabots would 
be three, which is vmy probable. We may tontativdy consito it es eateUidied that the key- 
word is 9 letters in length. 
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NkttP now Jin>>;raiii (A) of figure GO. Tlicre are six spnccs vacant at the end of tlio koy- 
v'onl alpluihet. Tlic loltoi-s from S to B in llic normal olpluibot aru TUVVXYZA, a 
total o: oidit. Xon- we know tli.at U is in flip kcyivonl proper so tiiat the residual sequence 
dops not contain this letter. Of the letters, V W X Y Z A, tlie one most likely to be present in 
the keyword proper is A, leaving the sequence V W X Y Z os the end of the kqy-word alphabet. 

The ki^'word proper consists of the letters not present in the residual sequence. It must, 
thonefore, consist of Uie letters A, C, E, I, L, 21, N, R, and U. We know fhat the letters U II fonn 
the second and third letters of the word; and the position of the indicator groups in the cipher 
text mokes L very probable as the initial or fiiiiU letter of the word. Now, very few words begin- 
ning with L U M and containing the other lettcra A, C, E, I, N, and R can be found. But if we 
assume L to be the final letter of the keyword, the most proba^ ending would be C A L. Given 
. U U . . .CAL, the word NUMERICAL soon suggests itsdf. 

With ilie keyword at hand, every generating rectangle can be constructed at once, and the 
messages may now be deciphered os rapidly as by the l<^timate recipient. 

The decipherment of message 1 follows herewith: 

FiauBB 01 



KIBIY PPPLH GZ Q P B MAWFV TBNBI FUEB... 



Alplmbot 1 



Alphabet 3 
'Alphabet 5 



fNOZUPBHQDESFR 
It G I V H C W J A X K L Y| 
fRNOZUPBMQDESFj 
1 l Y T G I V H C W J A X K1 
fESFRNOZUPBMQDJ ^ 

Ijaxklytgivhgw 



Generating leotongle 
NUMERICAL 

opqstvwxy 

ZBDFGHJK 
Interruption key 
N 0 Z U P B 

t S S • 4 1 



2 3 6 



5 4 12 



IS ISI 1SS4S4 XSS4( 1SS4 1 IS 



K I 


B I Y 


F P P L H G 


Z M A W F 


V T B N 


B 


IF.. 


I K 


Y I B 


G H L P P F 


F W A M Z 


N B T V 


B 


F I 


Z E 


R 0 H 


0 U R W I L 


L B E A T 


T W 0 0 


C 


L 0 


Zero hour will be at two o’clock . 


ft 









The fatal defect in this system lies in tlie fact that a key is used which introduces a frequently 
repeated cycle within a message. The dctoiruination of the length of this qycle, and its recon- 
struction by means of a comparison of alphabets based upon the index of coincideDce, enables a 
speedy solution to bo attained. The insertion of the key indicators within messages makes tltc 
rcconsfructioii of the keywonl and the coxiscquciit solution of a series of messages very easy. 
The many details involved in oiiciphonnent, and decipherment, concomitants of an attempt to 
make the entire operations dependent upon the knowledge of a single keyword and the ease 
witli which a solution may be odiioved in tlto case of a single message or a series of messages mokes 
this cipher unsafe for use iu eitlior the field or the more important operations of the larger head- 
quarters in the rear. 
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